STATISTICS 


A BEGINNING 
KUEBLER /SMITH 
| el HU 2M. 


#22 


STATISTICS 
A Beginning 


„з 


STATISTICS 
A Beginning 


Roy R. Kuebler 


University of North Carolina 
at Chapel Hill 


and 
Harry Smith, Jr. 


Mount Sinai School of Medicine 
The City University of New York 


WILEY INTERNATION, L КТО 


JOHN WILEY & SONS, Inc. 


New York / London / Sydney / Toronto 


26' - 


Copyright © 1976 by John Wiley & Sons, Inc. 
All rights reserved, Published simultaneously in Canada. 


Reproduction or translation of any part of this work beyond 
that permitted by Sections 107 or 108 of the 1976 United States 


Library of Congress Cataloging in Publication Data 
Kuebler, Roy Raymond, 1911- 


VUES өк ек a 4335 


HA29KNA 519.5 7535717 
ISBN 0-471-50028-0 


Printed in the United States of America 
1098765 


Preface 


This book is intended to introduce the well-tempered person to the basic 
notions and processes of probability and statistics as they apply to analyzing 
data and drawing conclusions therefrom. To be well tempered, in our view, is 
mostly a matter of having some modest interest in the subject, a bit of industry 
and patience, and a reasonable recollection of early high-school mathematics. 

These days we all are besieged with numerical data and numerical arguments 
seeking to influence us one way or another. The cynic in us says sarcastically 
that “you can prove anything with statistics," The honest truth is that you can 
prove absolutely nothing with statistics. You can calculate odds, and you can 
make decisions in chancy situations with some specified control over the risk of 
error, and you can talk back to the statisticians, We think that these аге 
worthwhile aims, and that they are attainable to а decent practical degree 
without elaborate training. Thus we hope for an audience of students from late 
high school up, technicians in and out of technical institutes, and businessmen 
of all varieties, indeed the general public. 

The book is elementary in that it begins at the beginning of the subject and 
deals with just the basics, It is elementary too in mathematical level—no 
calculus, no set theory, and no complicated derivations, But it is a mathematics 
book, written by two people who are convinced that the grasp of even the most 
rudimentary mathematical idea requires practice of the do-it-yourself kind. So 
there are many examples showing mathematical details and many exercises for 
the reader to work on. 

After much experience in studying and teaching statistics, and working as 
consultants with operational, management, and research workers, the authors 
have been led to try their hand at setting down an introduction to statistical 
analysis that is at once elementary and precise, intuitively motivated but also 
theoretically sound. It is a manual that gives a first level of statistical compe- 
tence based on mathematical ideas and procedures. Though elementary and 
thus incomplete for advanced study, nothing in the book will have to be 
“unlearned” if and when the reader goes on to higher levels. Our aim may well 


vi m PREFACE 


have exceeded our grasp, but we hope for that corrective aid that comes from 
irate readers who will not suffer in silence. 

To all of the people with whom we have worked in statistics classes and 
statistics consultations we owe a debt of thanks for asking the questions and 
testing the answers that we have tried to organize in this book. To all of our 
friends who have given encouragement and advice we are indebted. High on 
the list must be Mrs. Doris Smith, who lived through it all and read every word, 
with editorial pencil in hand, to give us the critical comments of the general 
reader. The contents of Tables A-1, A-4, A-5 were produced by the computing 
group of the Department of Biostatistics, University of North Carolina at 
Chapel Hill. To Mrs. Karen Wendt and Mrs. Delores Gold we are most 
grateful for patience and skill beyond the call of duty in translating our 
handwriting into a typewritten manuscript ready for the printer. АП of these 
good people deserve a share of credit for any success our efforts meet. The 
failures and errors that you find must of course be charged completely to the 
authors. 

Roy R. Kuebler 
CHAPEL HILL, NORTH CAROLINA Harry Smith, Jr. 
NEW YORK, NEW YORK 
JULY 1976 


Note: A manual for the teacher and a workbook for the student are available 
to supplement the use of this text. 


Contents 


1—DATA FOR STATISTICS 


1.1 


1.2 


1.3 


1.4 
1.5 


Introduction 
Exercises 


Data on some Mecca Community College students . 


Exercises 


Data classification . 

A. Discrete. data. : 
Nominal observations . 
Ordinal observations . 


B. Continuous data 
Interval-scale data . 


Ratio-scale data . 


Exercises 


Summary exercises . 


2—SUMMARIZING DATA GRAPHICALLY 


2.1 
2.2 


Introduction 


Tabular summaries . 
Exercises 


24 


26 
30 


viii ш CONTENTS 


2.3 


Graphical presentation . 


A. Graphical methods for ПОША. зна ordinal data : 


Bar chart . 
Pie chart 
Exercises 
B. Graphical methods for meri айа ratio ere 
data . 
Histogram . 
Frequency polygon 
Cumulative frequency M 
Cumulative percentage polygon . 
Exercises 


3—SUMMARIZING DATA NUMERICALLY 


3.1 
3.2 


3.3 


Introduction . 


Measures of centrality 
A. Arithmetic mean . 
Example 3.2.1 : 
(Arithmetic) mean summary 
B. Median. 
Example 3.2. 2 
Median summary 


C. Midrange . : 
Midrange summary 
D. Mode 
Example 3.2. 3 


Mode Summary . 8 
E. Other measures of centrality . 
Review illustration: Example 3.2.4 . 
Discussion of review illustration . 
Exercises 


Measures of variability 
A. Range . у 
Ехатрје 3.3. 1 
B. Variance (5°). 
Example 3.3.2 
Intuitive explanation of deprees of еза 
(here, n — 1) 


49 


CONTENTS m ix 


Ся Standard deviation Wsi ve. 602 ss 2% aes a Ё$ 
Example 3,33 << + + . RUM NE 

D. Coefficient of variation (C. V) Ue Сим у-ү. 
Example 3.3.4 . . o . зе 5 0 Eu ОО 
Review illustration: Exampls 3. 3. s. du meu 766 

3.4 Some comments on terminology and computation. . . . . 88 
БЕБЕ О ВЕК кд Ae row xus v ws 


3.5 Some comments on looking summary statistics in the 
eye: cia у m os S LM ама eo are 198 
Summary exercises cw Gh now we oss s 198 


4—STATISTICS AND CHANCE 


4.1 Description and inference in statistics. . . . . . . . . . 100 
42 DefiniGonofprobabilty. < < « r =se ose = = „ 103 
Example 42 aca Бо 5 жаза mud hans deem. s ov = 104 
Example 422) eae os wo =з ex 105 
Example 42.3: o e e ИМОНИ ИЕ . +. 103 
Example а Ane, Go UAM Lacu Moos px eye ey + ШӘ 
Example A25 u ue mre oe +. $oigups sow. 105 
4.3 The practical meaning of probability . . . . . . . . . 107 
Exercises = ee eee ае звы SS oe ey к ^ 
44 Independent events . . . o o cmu wos 113 
4.5 Bernoulli trials. The binomial distribution . . . . . . . . 117 
Brampton ees з=» мю Rete es з жж s Med 
Brample 4:52" ==» з» om vom mos a Ros 28 
РОНЕ ЭИ €T TCR CCS оно L' 6 5 « 122 
AG Ранетедбеснапсе па с es bos s we ov x a we = 124 
4.7 The standard normal distribution . . . . . . . . . . . 127 
Exercises. pew QUOQOE ин ге гени: 
4.8 Descriptive measures in probability distributions. . . . . . 132 
Ека ре ЖаШ eee ru ues m E eue veo з ДЭЗ 
Example 4:82 cos a9 € по ко wot us m mm mm o 136 
Example 48/99? * BAD Mo. & w би вв Wow s 136 


Example 4.814 Ue ME ues uou ee e os eon sw os 137 


x m CONTENTS 


4.9 


Example 4.8.5 
Example 4.8.6 
Example 4.8.7 
Example 4.8.8 . 
Example 4.8.9 . 
Example 4.8.10 . 
Exercises 


Normal approximation to the binomial distribution . 


Example 4.9.1 
Example 4.9.1a . 
Example 4.9.2 
Example 4.9.3 
Exercises 


5—EDUCATED GUESSING 


5.1 
5.2 
5.3 


5.4 
5.5 


5.6 


5.8 


The use of a random sample . 
Definition of a random sample 


Drawing a random sample from a finite population. 
Exercises 


The probability distribution of a sample mean 


A confidence interval for ш when ø is known 
Example 5.5.1 
Example 5.5.2 


Required sample size . 
Example 5.6.1 
Example 5.6.2 
Exercises 


A confidence interval for ш when c is unknown. 
Example 5.7.1 

Example 5.7.2 

Example 5.7.3 

Exercises 


A confidence interval for the difference between two 
means, pı—u2, when o, and o> are known 


138 
140 
141 
142 
142 
145 
147 


149 
150 
151 
152 
152 
154 


CONTENTS m xi 


Example 8T окна (со љети ти ж 494 
Example: ЗОО у А es ф#@ аа a % OS 
Example 5:83. 2 2 mos mus € € woe ee mw 9 085 
Example 5.8.4 2 Ја: € oh » s » 99» зык + » 186 

5.9 А confidence interval for the difference between two 
means, џл—иг, When o; and ог are unknown . . . . . . 187 
Example 5:03 со о SOP SG sw a ele о а а = « L89 
Examples о Пашин aoc. GR SLD v.e x + x * 120 
ВХС Actas ms И а ue о Че mon “a rcu ДОТ 
5.10 A confidence interval for the binomial proportion р . . . . 194 
Example БИО 2o. со а es eh ewe w* € а а = e DS 
Буар S023 u- 4 . e m g ж за ЗА mesmo ceo ме ДӨЙ 
5.11 Required sample size for estimating a proportion p. . . . . 199 
Exsmple531.1 x 2 € € ox € $09 «Бов ко то же на 200 

5.12 A confidence interval for the difference between two 
population proportions, pi-ps > . «s +--+ 201 
Буре ЛӘН s rosa Toe ox ote а oun om Rom on roe 202 
Реке 4 9 mus UA HOW ECEOR rod € Re wow eos BOD 


6—TO REJECT OR NOT TO REJECT 


6.1 The role of statistics in the scientific method . . . . . . . 206 
6.2 The level of вїрїїйсаасе............... 209 
63 'Thecriticalrégion s « ~ = e 9 = во ка» o9 6 o 210 
6.4 Performing the test. © = =s < ....‹ · 21 
6.5 The descriptive level of significance. . . . . . . . . . . 213 
6.6 One-tailed and two-tailed tests . . . > . > + + +s + · 214 
Exereises! AN d uu» crow мр 4 XU oe eR pow 205 

6.7 Tests concerning ш when c is unknown . . · · ~~... · 216 
Example бИ 3 4 9 dos msi 9$ 9 s 9 s os E VS 216 

6.8 Relation between testing and estimating. . . . . . . . . 218 
Example6.8 2-2 € ke нож нов ш moe А s 219 
Example G:82 ас сена поене ES HG а 220 
ИЕ УЧИ . 220 


Exercises 


xii ш CONTENTS 


6.9 


Tests concerning the difference between two 
population means 

Example 6.9.1 

Exercises 


6.10 Tests concerning the binomial proportion p 


Example 6.10.1 . 
Example 6.10.2 . 


6.11 Tests concerning the difference between 


two population proportions . 
Example 6.11.1. 
Exercises 


7—SORTING OUT THE CATEGORIES 


74 
72 
73 
7.4 
7.5 
7.6 
Л 


7.8 


Introduction 

A binomial problem 

1x2 tables 

1хс table . 

2X2 contingency table 

The rX c contingency table. 


Oiher useful x? tests . : 

A. Test of homogeneity (2x2 сазе). 

B. Test of homogeneity (rX c сазе). 
Example 7.7.1 

C. Test of shift in binomial proportion 


Exercises 


8—PREDICTING WITH CONFIDENCE 


8.1 
8.2 
8.3 


Introduction . 
An example . 


Fitting an equation to the data . 


NNN 
NNN 
ANN 


оо 


м м м 
NNN 


© оо 


о мм го о 
Un Un л Un л 
л МЮ МЮ © © 


N 
wn 
an 


260 
261 
261 


8.4 


8.5 


8.6 


8.7 


8.8 


8.9 


Exercises 


Test of hypothesis about f, the slope of the 
population regression line Morus 7 


Interval estimate for 8 
Exercises 


Predicting the average response at a given value of x, 
say Xk. 


Predicting the next observation at a given value of x, 
say Xk. 


Analysis of residuals . 
Exercises 


Summary exercises. 


A Backward Glance 


Appendix A: Tables 


Appendix B: Numerical Answers. 


Index . 


CONTENTS ш xiii 


266 


275 


276 
277 


ZIT 


282 


285 
286 


287 


293 


294 


307 


317 


STATISTICS 
A Beginning 


ALL RIGHT, 

LET'S GET 
TOGETHER 
OUT THERE! 


I THINK МАЧВЕ, PERHAPS, 
HOPEFULLY, IF EVERY THING 
GOES RIGHT AND NOTHING 
UNPREDICTABLE HAPPENS, 
POSSIBLY I GOT IT! 


LET'S START CALLING FOR 
THOSE FLY 


THAT ISN'T EXACTLY 
WHAT I MEANT! 


Data For 
Statistics 


1.1 INTRODUCTION 


АП of us are constantly annoyed 
by the confusion and misunder- 
standings that occur in verbal and 
written communication. It is then 
reasonable to ask, “‘how can things 
be more clearly stated so that such 
conflicts can be avoided?" A scien- 
tist such as a chemist, mathemati- 
cian, or engineer attempts to re- 
solve these problems by becoming 
very quantitative. For instance, a 
chemist will measure the amount of 
liquid in a vessel to the nearest 
tenth of a cubic centimeter and 
state, "there are 70.6 cubic cen- 
timeters of liquid in this vessel." 
Such a statement is very clear and 
very seldom leads to any misunder- 
standing. 

However, not every quantitative 
statement is this clear. Consider 
the following statements: 


1. John Jones's salary is one half 
of Joe Emery's. 

2. Mary is twice as smart as 
Sarah. 

3. This is the fifth largest snow- 
fall in Schodack's history. 

4. In one study, Crest tooth- 
paste reduced cavities 42 
percent when compared to a 
control. 


Each statement has some quantita- 
tive aspect that, by itself, is under- 
standable; that is, we know the 
meaning of “опе half," “twice,” 
“fifth,” “42 percent," and so on, 
but there is still much opportunity 
for confusion and misinterpreta- 
tion. 

For example, what is really 


2 m DATA FOR STATISTICS 


meant by statement number 2? Is there any valid way that such a 
judgment could be made? Are we talking about grades in school, results on 
achievement tests, or just some overall subjective judgment? In any case the 
word “twice” conveys something definite to the listener. He knows that 
2x4=8, 2X 10= 20, and so on. However, the use of the very specific quantita- 
tive word "twice" can still be subject to a great deal of controversy. 


Statement number 3 might convey something important to a weatherman in 
Schodack, but it doesn't convey much to anyone else. Everyone knows that 
there have been four snowfalls larger than this one, but there is no way for us 
to understand anything more than just that. Fifth relative to what? How large 
was the snowfall? 


To make statements less quantitative doesn't help much either. For example, 
statements like “this is the worst smog this year" or “the policeman gave 
Eunice a ticket for speeding" conjure up various pictures in people's minds. 
Different people interpret things differently, and the more subjective or per- 
sonal interpretation possible, the more confusing things become. 


The subject “statistics” is an attempt to bring some understanding to the use 
of quantitative measures in any kind of statement one might like to make. We 
will find that the use of statistical quantities will be severely restricted by the 
kind of data one has, by the way in which we will draw conclusions and make 
predictions, and finally, by the people to whom statistical reports are submit- 
ted. Being thus restricted, and carefully defined, statistical quantities have a 
chance of keeping quantitative statements clear of confusion and misinterpreta- 
tion. 


In this book we hope to give you the beginning ideas of what statistics is all 
about. And the beginning of that beginning can well be your consideration of a 
number of illustrative examples of quantitative statements that leave something 
to be desired. For that purpose the following exercises have been compiled. 


1.1 INTRODUCTION m 3 


EXERCISES 


For each of the statements below, do the following: (a) underline the numerical 
quantities, (b) determine what additional information is required to enable you 
to discuss the statement, (c) rank the statements A-F in order of clarity, 1 being 
the most clear and 6 being the most unclear, and (d) discuss the usefulness of 
each statement. 


. *Love-ins that extend for more than one night become a deadly bore, even at 


age 17. I think 28 minutes is roughly all the time you need for a love-in." (Peter 
Drucker, Our Top-Heavy Corporations, Dun's, April 1971.) 


. “Analyses show that 18% of staff employees are minorities and 63% women." 


[Powers, Mary F., End discrimination to hold onto Federal funding, Coll. 
Manage., May 1971 (article on Univ. Pittsburgh).] 


. “In transportation, while U.S. road vehicles continue to increase in numbers 


twice as fast as the human population, the creation of new mass transportation 
systems to relieve our choking roads lags far behind—though not for lack of 
abundant technology." (Lessing, Lawrence, The senseless war on science, For- 
tune, March 1971, p. 154.) 


. “Surveys indicate that as many as 40% of engineers would choose a different 


profession if they had a chance to start again." (Gooding, Judson, The engineers 
are redesigning their own profession, Fortune, June 1971, p. 72.) 


. “Approximately 43% of the women and 30% of the men failed to recognize 


feelings of love until after 20 or more dates . . . .” [Brothers, J., Ask Dr. Joyce 
Brothers, Durham Sun, Friday, July 3, 1970, Durham N.C. (quotation from a 
Midwestern Univ. Survey).] 


. *A study of 10,000 workers in the Chicago area has found that 3000 of them 


can be considered high risk candidates for heart attacks...." (N.Y. Times, 
March 22, 1970.) 

Collect three examples of interesting but confusing statements from any of the 
media such as newspapers, magazines, or television. Explain why the statements 
are confusing or misleading. 


4 m DATA FOR STATISTICS 
1.2 DATA ON SOME MECCA COMMUNITY COLLEGE STUDENTS 


Discussion of the difficulties associated with the use of quantities, numbers, 
percentages, and so on in Section 1.1 leads us naturally to consider the basic 
characteristics of data. Is there a structure to data, and, if so, will this structure 
help us in making more valid use of numerical or other quantifiable informa- 
tion? 

In order to help us understand the basic structure of data, we shall use the 
results of surveying 180 upper-class students in a large community-college 
system to indicate the kinds of data we shall be considering in this text. The 
data obtained from each student were coded and recorded below. 


TABLE 1.2.1 Survey Results on 180 Nonfreshman Community 
College Students (Mecca Community College) 


— crc qemcAcec EE 
Commuting Distance 


from Home to Political 
Student College (Miles to Party Marijuana Freshman 
Number Sex Nearest One Half Mile) Preference* Question” G.P.A.* 
1 M 24 R 3 2.42 
2 M 15 D 1 1.75 
3 F 9 N 4 2.06 
4 M 15 R 1 3.00 
5 M 25.5 о 3 2.22 
6 M 9.5 R 4 2.71 
r4 M 11.5 N 5 3.26 
8 M 5.5 R 1 2.90 
9 F 5 R 1 2.00 
10 M 32 D 5... 3.59 
11 F 24 N 5 2.46 
12 M 26.5 D 4 2.33 
13 M 5.5 D 5 3.15 
14 M 35 R 1 2.47 
15 M 5 D 1 3.11 
16 M 6.5 N 4 0.81 
17 M 7 р 1 1.13 
18 M 22.5 D 4 2.60 
19 M 32 D 4 2.84 
20 Е 6 R 4 2.70 
21 M 10 N 4 2.26 
22 M 3.5 R 3 1.35 
23 Е 7 N 5 2.12 
24 M 4 D 4 1.54 
25 M 3.5 R 2 1.61 
26 M: 28.5 R 1 2.15 
27 M 10 D 1 3.18 
28 M 32 N 1 2.66 


Commuting Distance 


from Home to Political 
Student College (Miles to Party Marijuana Freshman 
Number Sex Nearest One Half Mile) Preference” Question” б.Р.А.° 

29 M 75 D 4 1.34 
30 M 29.5 N 4 2.57 
31 M 4 R 4 1.71 

32 Е 15 D 4 2.41 

33 F 3.5 D 1 3.03 
34 M 4 N 4 2.50 
35 F 27.5 D 5 2.91 

36 M 2 N 1 1.20 
37 E 4.5 N 1 2.23 
38 M 36.5 D 4 3.09 
39 M 4 о 2 1.66 
40 Е К) R 2 2.67 
41 F 16.5 R 4 2.48 
42 F 15 D 4 3.06 
43 M 56.5 D 2 3.31 

44 F 27.5 N 5 2.39 
45 M 25 R 2 271 

46 F 13.5 R 1 2.74 
47 M 13:5 R 2 2.85 
48 F 20 N 4 2.69 
49 M 56.5 D 4 3:37 
50 M 2 N 1 2.63 
51 M 12.5 R 2 2.09 
52 F 0.5 N 5 271 

53 F 3.5 N 1 2.68 
54 M 3.5 N 4 2.68 
55 Е 7 р 1 2.00 
56 M 11 N 4 2.89 
57 M 0.5 N 4 1.94 
58 P Б.Б D 3 2.59 
59 F 12.5 R 4 2.92 
60 F 26.5 D 4 2.08 
61 F 14.5 R 1 2.62 
62 E 0.5 R 4 2.18 
63 F 28.5 D 4 2.35 
64 M 7.5 N 1 1.06 
65 M 11.5 N 5 2.57 
66 Е 20.5 р 1 2.23 
67 M 1 N 4 1.91 

68 F 25.5 D 4 2.98 
69 F 13 R 2 2.41 

70 F 17.5 D 5 2.03 
71 F 17 N 4 2.47 
72 M 11.5 N 4 2.12 
73 M 3.5 R 1 2.38 
74 M 7.5 D 5 2.40 
75 F 2 N 5 2.95 
76 M 1 N 4 2.81 


Commuting Distance 


from Home to Political 
Student College (Miles to Party Marijuana Freshman 
Number Sex Nearest One Half Mile) Preference* Question” G.P.A.* 
rum em o a Mp xpi 10 108787077 ЗҮ E NM 
77 M 3 N 4 2.78 
78 F 8.5 D 4 2.44 
79 F 19.5 D 4 2.53 
80 M 6.5 D 4 2.50 
81 F 2.5 R 1 1.64 
82 M 3 N 5 2.25 
83 M 1.5 N 4 2.28 
84 M 29.5 R 3 2.14 
85 M 15.5 N 5 3.10 
86 F 13.5 R 1 3.36 
87 M 12 D 1 2.90 
88 M 3 N 4 1.46 
89 M 4 D 5 3.36 
90 F 23.5 N 1 2.34 
91 M 28.5 N 4 2.12 
92 M 3 [о] 1 3.05 
93 M 20 R 4 2.19 
94 M 17 N 4 3.00 
95 M 0.5 R 4 2.46 
96 M 3.5 N 3 3.22 
97 E 6 D 4 3.16 
98 M 19.5 [9] 4 2.04 
99 M 1 R 5 2.21 
100 F 1 R 1 3.21 
101 M 35 D 4 1.83 
102 M 1 D 5 4.00 
103 M 13 R 4 1.56 
104 E 13 N 4 2.39 
105 F 10.5 R 4 2.40 
106 F 24 D 1 2.61 
107 M 22.5 D 4 2.28 
108 M 24 D 5 3.17 
109 M 15.5 D 2 2.73 
110 M 6 N 4 3.09 
111 M 19.5 D 4 2.78 
112 M 3 D 4 1.67 
113 Г 1 р 1 2.35 
114 M 1 N 4 1.98 
115 F 8 D 1 1.82 
116 M 7 D 4 1.72 
117 M 21.5 N 5 2.85 
118 M 8 N 4 2.12 
119 M 3 N 4 1.71 
120 F 2.5 N 1 3.19 
121 F 6.5 N 1 3.86 
122 M 5 D 1 2.51 
123 M 46.5 N 5 2.08 
124 M 2 R 4 1.88 


Commuting Distance 


from Home to Political 
Student College (Miles to Party Marijuana Freshman 
Number Sex Nearest One Half Mile) Preference* Question” G.P.A.° 
125 F 22.5 D 1 2.13 
126 F 12.5 N 5 1.94 
127 Е 17 D 1 1.75 
128 F 6 R 2 1.53 
129 M 25.5 N 5 1.67 
130 M 5 R 4 3.55 
131 F 19 R 4 2.16 
132 M 6 R 4 2.37 
133 F 5 N 4 2.42 
134 F 36.5 N 5 2.44 
135 F 9.5 R 4 2.02 
136 M 10.5 R 4 2.92 
137 M 36.5 D 5 2.37 
138 M 15 R 1 2.72 
139 F 3.5 R 1 3.09 
140 F 9 R 1 3.09 
141 M 3.5 о 4 2.61 
142 E 12 N 5 2.26 
143 F 8 R 1 2.15 
144 F 21.5 о 4 2.08 
145 M 7.5 N d 2.59 
146 M 4.5 R ~ 2.40 
147 F 23.5 D 4 1.80 
148 M 12 N 5 2.25 
149 M 7.5 N 1 1.93 
150 M v R 1 1.27 
151 F 1.5 р 4 3.40 
152 M 25 R 4 2.70 
153 M 4 D 2 1.91 
154 F 6 D 3 2.13 
155 F 5 N 1 3.29 
156 F 7 N 4 2.91 
157 M 10 N 2 2.15 
158 F 4 D 4 2.17 
159 M 29.5 R 4 2.79 
160 Е 28.5 р 1 2.36 
161 Р 27.5 N 4 2.60 
162 F 4.5 N 4 2.00 
163 F 12 D 1 2.50 
164 F 0.5 R 4 2.10 
165 F 12.5 N 5 2.66 
166 F 4 D 1 1.79 
167 F 4 D 4 1.86 
168 M 6 D 1 2.18 
169 M 8 R 4 2.63 
170 F 7 N 5 1.51 
171 F 9.5 R 4 1.85 
172 M 20.5 D 5 1.00 


8 m DATA FOR STATISTICS 


Commuting Distance 


from Home to Political 

Student College (Miles to Party Marijuana Freshman 

Number Sex Nearest One Half Mile) Preference* Question” G.P.A.* 
173 M 20.5 N 5 1.97 
174 Е 3 N 1 2.60 
175 M 2.5 N 5 2.66 
176 F 10.5 N 2 21 
177 F 0.5 N 2 2.05 
178 M 1 D 4 1.61 
179 M 12 D 1 2.28 
180 M 4 R 1 2.07 

• Political party preference: 
А = Republican 
D = Democrat 


O = Other party 
N No party preference. 
^ Marijuana Question: Statement: “Marijuana should be legalized” 


Opinion Scale 


1 2 і 5 


Strongly Mildly i Strongly 
Disagree Disagree 


* G.P.A.= Gradepoint average at end of freshman year (range 0-4). 


We shall refer to these data as observations taken on each of five survey 
questions from 180 community college students. Since the total number of 
upper-class students (in the system) was around 10,000, these 180 students 
represent a small subset of the total. We shall call this subset a sample of the 
upper-class students and refer to the number of students in the subset as the 
sample size. We denote the sample size with the letter n. Thus we say that the 
sample size here is n = 180. 


In case you are thinking about visiting Mecca Community College, we have 
to confess to you that it exists only within the pages of this book. It is a 
composite of facts and features of many institutions, real and imagined, put 


together to represent lots of places without running any danger of favoring or 
offending anyone. 


The data in the table are of the various kinds that we meet most often in 
quantitative investigations. There are purely categorical data: male, female; 
Democrat, Republican, other party, or no party preference. There are data that 
are categorical but subject to ranking: strongly disagree, mildly disagree, no 
opinion, mildly agree, or strongly agree. And there are the numerical data with 
which. we are most familiar in measurement terms: the distance between home 
and college, the student's gradepoint average (G.P.A.). 


It would be useful for the reader to think a bit about what might come out of 


this set of data, sensible or otherwise. The following exercises suggest some 
lines of thought. 


1.3 DATA CLASSIFICATION m 9 


EXERCISES 


1.2.4 Using the data on the 180 upper-class students at Mecca College, write five 
statements that you believe to be quantitatively clear. 


1.2.2 Write five statements using numbers, percentages, and so on from these data 
that you would consider controversial. 


1.2.3 What aspects of the data need clarification? 


1.2.4 If you were going to gather data (similar to these) at your school, what changes 
in topics, categories, or procedures would you make? 


1.3 DATA CLASSIFICATION 


Having discussed the confusion and misunderstanding that occur when one 
interprets numbers and having recognized the difficulties that arise even when 
data are clearly structured as in the Mecca Community College data, it is nice 
to know that there really is a structure to data. This structure will be very 
useful in helping us to understand how data can be used effectively. 


For the purposes of this text, we shall identify observations according to the 
following classification scheme: (a) discrete (nominal; ordinal) and (b) 
continuous (interval; ratio). 


A. Discrete Data 


When data are classified into categories such that any observation can fall 
into one and only one category, the data are said to be discrete. For example, in 
Table 1.2.1, sex, party preference, and the opinion on marijuana are examples 
of "discrete" data. However, discrete data can again be separated into two 
distinct groups: one group which is strictly qualitative in character and is called 
nominal, and another group, still qualitative, but which also has an inherent 
ordering characteristic and is called ordinal. 


Nominal Observations. The first and simplest form of measurement one 
can use on experimental results is to classify them by some nominal attribute, 
for example, "this is a male," or "this student is a Democrat." The chief 
defining characteristics in nominal observations are their qualitative nature and 
the equality of status within each nominal classification. Thus decisions on the 
political preference of the 180 students in the sample make no allowance for 
different strengths of preference among those stating a preference for Demo- 
crats. Likewise if one categorized color TV sets into two mutually exclusive 
categories, acceptable or nonacceptable, all TV sets classified as acceptable 
would have equal status within the classification even though some may be 
merely satisfactory and others excellent. Observations of this type are classified 


10 m DATA FOR STATISTICS 


in a very unsophisticated manner, and treatment of the data thus obtained is 
limited to using the total numbers of observations in the various categories. 
However, obtaining this kind of data requires very little effort and is usually 
inexpensive and quick. Also there are times when the counts in various 
categories are exactly what we are looking for. 


Ordinal Observations. The next step in classifying observations in a more 
quantitative way is to use an ordinal classification scheme. For example, bar 
soaps might be graded excellent, good, satisfactory, poor, and bad. While these 
are nominal-type classifications, they have an inherent ordering among them; 
that is, an excellent bar is in some qualitative way a better bar than one that 
falls in the good category. Likewise the opinion scale for considering the 
legalization of marijuana is inherently ordered. It is easy to see that this type of 
data can be of more use to an experimenter than can nominal observations. 

Another example of this kind of data would be the following data on the 
classification of coal miners according to the seriousness of a lung disease called 
pneumoconiosis. 


Degree of Pneumoconiosis Among 
100 Coal Miners 


While these data reflect an ordering of severity, no decision can be made 
about the magnitude of the difference between a minor case and a major case, 
or between a major case and a serious case. All we know is that a serious case 
is somehow worse than a major case and that a major case is qualitatively 
worse than a minor case. While such inherent ordering assists in making 
decisions or judgments somewhat more precise, people are just not content to 
make identifications, comparisons, and judgments using simple qualitative data. 
They want and continually strive to be more specific or precise in every 
judgment. The next step is to consider distance between categories as being 
more informative than what is shown with just category identification. Thus 
distance between categories should have a numerically meaningful value. 

Only one kind of ordinal data has such distance between categories natur- 
ally. That kind is counting data. Toss six coins in the air, let them fall to rest, 
and count the number of heads. That number has to be either 0d. 2,3. 4, 5. 
or 6. Our observations in repetitions of such tosses will thus fall into the 
categories 0, 1, 2, 3, 4, 5, and 6, and those categories have all the numerical 
rights of the numbers involved: 5 is 2 more than 3, 4 is twice as large as 2, and 
so on. 


1.3 DATA CLASSIFICATION m 11 


Such counting categories can go on and on. Set up an experiment in which 
you toss one coin repeatedly until a head appears. Record the number of tosses 
required. Well, that number can be either 1 (head on the first toss can occur), 
2, 3, 4, or on and on. In theory, you may keep getting a tail virtually forever. 
Thus in observations on such an experiment, the categories of those observa- 
tions аге 1, 2, 3, 4, 5, and so on. Mathematicians write 1, 2, 3, 4, 5,... and 
mean by those three dots “оп and on in this manner without end.” We say the 
number of categories is countably infinite. 

Categories of count do not have to march 0, 1, 2, 3, 4, and so on. Pairs of 
socks get counted 1, 2, 3, and onward but the corresponding socks get counted 
2, 4, 6, and onward. Integers аге 1, 2, 3, 4,... but squares of integers are 1, 4, 
9. 16,.... The distinctive feature of count data, as with all ordinal data, is the 
discreteness of the various possible categories. 

Ordinal data that are not counting data are often coded numerically for 
convenience to computers. We can code male as 1, female as 2 (or vice versa!). 
We can designate 1 as Republican, 2 as Democrat, 3 as Other Party, and 4 as 
no party preference. We have coded opinion on legalizing marijuana, taking 1, 
2, 3, 4, and 5 in order along an opinion scale ranging between strongly disagree 
and strongly agree. 

A hazard in using such coded identification of categories is that the codes 
will be interpreted as count data, with distance mistakenly assigned to the 
spacings between categories. For example, in the scale (or coding) used for the 
opinion on legalizing marijuana, the following implication might be made: 
With a score of 1 for strongly disagree and a score of 5 for strongly agree, then 
strongly agree is five times the score of strongly disagree. Is that what the coder 
or experimenter wanted to imply? It also implies that you can plot the data in a 
scale having equal spacings between categories; that is, 


Perhaps this is not meant either. We need to be carefully on guard as to how 
different interpretations can be made using the same data when arbitrary 
scaling or coding is used. АП of this leads us to search enthusiastically for data 
that do allow us to measure distances and make neat numerical comparisons. 
And so we go to our next classification. 


B. Continuous Data 


When the data are obtained using a continuous or noninterrupted scale of 
measurement such that numerically equal differences stand for empirically 
equal differences, we say that the data are continuous. There are two different 
types of scale for such data. 


12 m DATA FOR STATISTICS 


Interval-scale Data. If the scale has an arbitrary zero point, then the data 
are obtained using an interval scale. Typical examples of interval scale data are 
those obtained using the centigrade or Fahrenheit temperature scale, using the 
year designations such as 1975 and 1976, or using indices in which the zero 
point is determined by an arbitrary definition, as for I.Q., health status, and the 
like. 


The following table indicates a typical ordered qualitative classification of 
data and also the use of an interval scale. [Pratt, Lois, The relationship of 
socioeconomic status to health, Am. J. Public Health 61 (2), 281-291 (1971), 
Table 1]: 

TABLE 1.3.1 Quality of Health Maintenance Practices in 


Relation to Level of Health and Extent of 
Health Problems * 


Average Scores on Indexes of 


Quality of Health- Level of Extent of 
Maintenance Practices Health Health Problems 
Poor 23 33.7 
Medium 2.8 27.9 
Good 3.1 24.2 
Total Group 2.8 27.8 


= 


Consider the index on the status of health. Does a zero on this scale mean 
“no” health at all? Certainly not. The zero is arbitrarily defined. It has no real 
health meaning. Does the index score 3 mean “twice as healthy" as index score 
1.5? Surely not; we cannot make health sense of such ratios. While such 
restrictions place very little constraint on the numerical analysis of these data, 
there are some difficulties in interpretation associated with interval-scale data. 


In the data on the n = 180 upper-class students, the observations on G.P.A. 
are measured on an interval scale. We are all aware that a 2.00 or C in one 
class is not the same as a C in another class. Further, no one would agree to 
statements like, *John's G.P.A. is 3.00 and Jim's is 2.00; therefore, John is 
three halves or one and one half times as 'smart' as Jim." Thus any ratio 
calculated using an interval scale is suspect. 


Another example is the “‘rub-for-suds” test for Soap bars. In this test, the bar 
soap is handled in such a way that the number of standard rubs of a towel 
against the soap while wet is the inverse measure of the sudsing quality of the 
soap. The towel is rubbed against the bar in such a way that the towel is carried 
down to water in a pan beneath the bar at the end of every rub, and the soap is 
rinsed out before the next rub. The test is mechanized, and the rubbing of the 
soap bar is continued until enough soap is in solution to form true suds which 
will persist for 15 seconds after agitation. It is not necessary that the suds reach 


*Copyright 9 1971 by the American Public Health Association, Inc. Reprinted by permission of the 
author and publisher. 


1.3 DATA CLASSIFICATION m 13 


a particular height or that they cover the surface of the water, but even so tne 
number of rubs-for-suds is an inverse measure of the sudsing quality of the 
soap. The results are reported as the number of rubs required for a given 
amount of suds; normally this turns out to be about 100 rubs, and it is said that 
a bar requiring 200 rubs is only half as good as the first, standard bar. 
Conversely a bar requiring only 50 rubs to give a persistent suds is said to be 
twice as good as the standard bar, but is it? There is no absolute nature to the 
observations, since they are referred to an artificial zero standard. The zero 
thus assigned is not the lower limit at which the property of sudsing vanishes, 
and consequently the results are truly definable as lying on an interval scale. 


Ratio-scale Data. Капо scales are interval scales with the distinction that 
they have an absolute zero. Thus measurements like height, weight, amount of 
income, and level of iron in the blood stream are examples of ratio-scale data. 


In one article is given the distribution by 1967 per capita personal income of 
134 counties in Texas with a commodity distribution or food-stamp program in 
September 1968. ([Lukaczar, Moses, Lessons for the Federal effort against 
hunger and malnutrition from a case study, Am. J. Public Health, 61 (2), 
259-276 (1971)]; data modified for purposes of this illustration.) Income 
values like 1999 stand for 1999.999..., closing up the continuous scale 
needed to accommodate every possible per capita income value. 


Table 1.3.2 is this example of a set of continuous data on a ratio scale. 
Another example is the data on the commuting distance between college and 
home in Table 1.2.1, where “zero” distance is a real zero value. 


TABLE 1.3.2* 

eS oap ou —————————M 
Per Capita Income Midpoint Number of Counties 

раа DM Sip eee oci t dt 


Less than $1500 750 20 
$1500-1999 1750 50 
2000-2499 2250 25 
2500-2999 2750 23 
3000-3499 3250 9 
3500-3999 3750 6 
4000-4499 4250 0 
4500-4999 4750 0 
5000-5999 5500 1 
Total 134 


pent et RD 
*Copyright © 1971 by the American Public Health Association, Inc. Reprinted by permission of the 
author and publisher. 


14 m DATA FOR STATISTICS 


TABLE 1.3.3 Summary: data classification 


eee 


Distinguishing Common 
Type of Observation Characteristic Examples 
|. Discrete Observations are grouped 

into distinct classes 

A. Nominal Distinct classes have no Patient either has 
predetermined rank or or does not 
order have disease A 

B. Ordinal Distinct classes have Patients are classified 
predetermined rank qualitatively by severity 


of disease; for 
example, degree of 
pneumoconiosis 


1. Continuous Observation may assume 

any value on a continuous 
scale 

A. Interval Scale defined in terms Patients’ temperature 
of differences between was recorded as 98° 
observations. Zero Fahrenheit. 
point is arbitrary І.О. measurements 

B. Ratio Scale differences represent The percent of 
real relationships population living 
in the items measured. on farms, 1970 census. 
Zero point represents Median family income 


total absence of 
attribute being measured 


14 EXERCISES 


И = ИЦ eee en 


1.4.1 Classify the following observations as to: (a) continuous or discrete, and (b) 
nominal, ordinal, interval scale, or ratio-scale data. 

- A ball bearing has diameter 3.25 millimeters. 

· Twenty-three students attended history class. 

- A toss of 10 coins resulted in six heads and four tails. 

- Jerry was 13th in the math test. 

‚ The Smith family was classified as a middle-income group. 

- A man swam 50 yards in 62 seconds. 

. Relative humidity reading of 55 percent. 

· The day's temperature was 0? centigrade. 

- 75 on an economics test. 

- John earns $35 an hour. 
List the classification of the data collected on the 180 Mecca College students. 


ооофотчтоош»> 


a 
EN 


15 SUMMARY EXERCISES m 15 
15 SUMMARY EXERCISES 


For each of the following paragraphs comment on: (a) degree of understanding 
transmitted to you, (b) problems in the paragraph for you; (c) the kind of data used, (d) 
any limitations these data cause, and (e) statistical problems you need to consider before 
agreeing or disagreeing with the author. 


1.5.1 A study of workers finds 30% of them heart attack risks, N.Y. Times, 
March 22, 1970.* 


A study of 10,000 workers in the Chicago area has found that 3000 of them can be 
considered high risk candidates for heart attacks. 

The finding came out of a two-year pilot study aimed at identifying and 
reducing the high cost of cardiovascular disease to business and industry. 

Тће American Heart Association, which released details here, said about 30 
per cent of the manufacturing and office workers screened had two or more of the 
risk factors associated with heart attacks. 

“This means," the association said, “that their chances of suffering an attack 
are more than double those of persons with no risk factors.” 

Of those in the high risk bracket, 10 per cent of 1000 persons had three or 
more risk factors, indicating that their risk of suffering a heart attack is as much as 
10 times normal. 

“If the Chicago findings are typical of the entire nation," the association said, 
“the national work force of about 80 million would include about 24 million in 
the high risk bracket, and about 8 million in the very high risk bracket." 


1.5.2 "Boyfriend's attitude disturbing to her," Dr. Joyce Brothers, 
Durham Sun, July 3, 1970. 


Dear Dr. Brothers: My boyfriend and I have been seeing each other constantly 
for three months. When we first started dating, it was he who seemed most 
involved. When he told me that he was in love with me, I still wasn't sure what I 
felt for him. 


But now that I know that I’m in love too it bothers me that he doesn't seem to 
show his feelings very much. Sometimes he seems more interested in playing 
tennis than being with me and every once in a while he forgets to call me when he 
promised because he gets absorbed in a book.—R.S. 


Dear Miss S.: From the time that a couple decides that they are in love to the 
conclusion of their courtship, whether in marriage or the break-up of the 
relationship, there is bound to be a certain amount of testing of each other's depth 
of feeling and commitment. Obviously, declarations of love alone are not enough. 
Behavior must appear to express the love that each feels. 

However, a man and a woman may differ in what behavior they consider to be 
indicative of love. A recent survey of men and women students at a large 
Midwestern university attempted to determine how men and women felt when 
they were in love. 

One finding was that male students tended to recognize and accept feelings of 
*©1970 by The New York Times Company. Reprinted by permission. 


16 m DATA FOR STATISTICS 


love earlier in the relationship than did female students. Approximately 40 
percent of the males, in comparison with 29 percent of the females, reported 
realizing the existence of love feelings in the beginning stages of the relationship. 
Approximately 43 percent of the women and 30 percent of the men failed to 
recognize feelings of love until after 20 or more dates. 


On the other hand, the survey found that women, once they acknowledged love 
feelings, were more likely than men to idealize and romanticize the relationship. 
Both sexes reported feelings of general well-being, difficulty in concentrat- 
ing, exuberance and giddiness as a result of their being in love. Women, 
however, reported experiencing these feelings with a greater intensity than 
men did. 


The authors of the study hypothesized that this difference in the intensity of 
romantic feelings and the tendency of women to idealize the men they were in 
love with might be due to the greater social emphasis put upon love and marriage 
as significant experiences for women. 


A man is not expected to let his feelings of love completely dominate his life to 
the detriment of other interests and commitments, while romantic preoccupation 
is tolerated, even encouraged in women. 


Your dissatisfaction with your boyfriend's failure to behave in a suitably 
romantic way seems based on idealized standards of what is appropriate behavior 
for a person in love. 


1.5.3 "State says many earn under $100—25% of workers get less than 'lower 
standard," " Peter Kihss, N. Y. Times, March 22, 1970.* 


One of every four full-time workers in private jobs in New York City and State 
was earning less than $100 a week last fall, according to a new State Labor 
Department study. 


This means that these employees, for working 52 weeks, would have got less 
than $5200 a year, including overtime pay. The Federal Bureau of Labor 
Statistics has estimated that a family of four needed $6771, as of last spring, to 
meet costs of a “lower standard" of living in the New York-Northeastern New 
Jersey area. 


The new state study indicated half the workers in industries covered by Federal | 
and state minimum wage laws earned less than $2.92 an hour in straight-time pay 
in the city last fall. The comparable figure statewide was $2.83. 


The state analysis became available at a time when rising living costs—the 
Consumer Price Index for this area has gone up 7.6 percent since February, 
1969—have spurred increasing labor unrest, including a letter carriers' strike. The 
mailmen's starting pay is $6176 a year—$2.97 an hour. 


The state study indicates that one of every 10 full-time workers in the city was 
earning less than $80 a week last fall. A four-person family on welfare rolls in the 
city receives a basic $208 grant a month plus an average of $100 a month for rent, 
a total working out to $77 a week. 


*91970 by The New York Times Company. Reprinted by permission. 


15 SUMMARY EXERCISES m 17 


Governor Rockefeller has been asking the Legislature to increase the state's 
hourly minimum wage. 


Тће Governor's proposal calls for a state minimum of $1.85 an hour effective 
next July 1, compared with a current state and Federal minimum of $1.60, which 
has been in effect since Feb. 1, 1968. On a 40-hour week, the Governor's plan 
would mean a minimum wage of $75 a week, up from $64. 


Earnings as of October. The new study by the State Labor Department's 
Division of Research and Statistics, whose director is Charles A. Pearce, offers 
estimates of employee earnings in private industry as of last October, both for 
workers covered by minimum wage laws and exempt workers. 


For full-time workers in New York City, defined as those working 30 or more 
hours a week, gross weekly earnings were estimated as follows: 


Earnings No. Workers Per Cent 

Under $50 3,333 0.1 
$50 and under $60 41,492 1.6 
$60 and under $70 90,727 3.5 
$70 and under $80 138,253 5.4 
$80 and under $90 177,081 6.9 
$90 and under $100 192,359 7:5 
$100 and under $125 420,741 16.3 
$125 and under $150 357,306 13.8 
$150 and over 1,159,208 44.9 

Total 2,580,500 100.0 


Counting 384,100 part-time workers as well, the city had 2,964,600 employees 
in private industry. Of the over-all total, 985,253, or 33.3 percent, earned under 
$100 a week. 


Of the city’s full-time workers, 643,245, or 24 percent, earned less than $100. 
The median earnings—that is, half the workers had more and half less—were 
estimated as $129.03 over all, full-time workers getting $140.76 and part-timers 
$53:27. 


The study included estimates of median straight-time hourly pay for workers in 
each of 87 industries under the minimum wage laws. Within New York City, 
those under $2.50 an hour included: 


18 m DATA FOR STATISTICS 


Workers Hourly 
Industry (in Thousands) Median 
Rubber & miscellaneous 
plastic products mfg 11.2 $2.11 
Leather & leather 
products mfg 30.6 2.12 
Variety stores 10.0 1.81 
Other general merchandise 
stores (excl. department 
stores) 11.3 2.22 
Food stores 61.9 2.22 
Apparel & accessories 
stores 55.5 2.19 
Drugstores 6.3 2.28 
Eating and drinking 
places 116.2 2.18 
Residential buildings 40.4 2.40 
Laundries, dry cleaners 24.8 2.01 
Beauty shops 13.2 2.17 
Barbershops 4.1 2.33 
Shoe repair, hat cleaning 1.0 1.93 
Miscel. personal services 2.1 2.47 
Temporary help agencies 21.4 2.26 
Motion picture theaters 6.0 1.86 
Dance halls, studios, 
schools 1.0 2.46 
Bowling and billiards 1.7 1.96 
Convalescent and rest homes 9.7 2.47 


1.5.4 A Newsweek poll: Mr. Nixon holds up, Newsweek, May 25, 1970, p.30* 

Even after the Cambodian invasion and the killings at Kent State University, 
the "silent majority" appears to be alive and well in Richard Nixon's corner. A 
Newsweek Poll conducted by The Gallup Organization last week suggests that— 
despite the recent intense criticism of the President by college students and 
academic leaders and by liberal politicians and commentators—Mr. Nixon's 
standing with the electorate remains undamaged. The poll indicates that Ameri- 
cans find Mr. Nixon's conduct of the Presidency *'satisfactory" by better than 2 to 
1, that 50 percent favor the Cambodian operation and 39 percent oppose it, that a 
strikingly large majority is far more willing to blame student demonstrators than 
National Guardsmen for the deaths of four students at Kent State, and that Vice 
President Spiro Agnew's rhetoric about dissenters still enjoys the approval of a 
silent plurality if not a majority. 


To get swift results, the survey was conducted by telephone on May 13 and 14 
and covered a scientifically selected national sampling of 517 persons.** 
*Copyright 9 1975 by Newsweek, Inc. All rights reserved. Reprinted by permission. 


** Telephone surveys, it should be noted, contain a slight built-in bias—about two percentage 
points, in this case—in favor of Republicans, since nontelephone households are necessarily 
omitted from the sample and these tend to be low-income and Democratic. 


15 SUMMARY EXERCISES m 19 


Although the poll gave the President majority approval of his decision to send 
U.S. troops into Cambodia, the favorable rating was by no means as high as some 
opinion experts have come to expect after dramatic strokes of U.S. military 
power, when Americans have a tendency to rally round the President. Following 
the air raids on North Vietnam that President Johnson ordered in 1965, for 
example, public approval (as measured by Louis Harris) soared to 83 percent. 
And 69 percent (polled by Oliver Quayle) favored the entry of U.S. troops into 
the Dominican Republic. 


Women were far more dovish than men on the Cambodian issue. They opposed 
the President's action, 49 to 37 percent, while men supported it, 63 to 30. Women 
also tended to be distinctly less enthusiastic about the Vice President's speeches 
on dissent: in a near even split (37 to 35 percent), they approved the Veep's line, 
whereas men applauded him by a margin of more than 2 to 1. Young people, too, 
were predictably more skeptical of the Administration than their elders, but even 
in the 21-34 age bracket, 55 percent gave the President a favorable rating and 49 
percent approved of Cambodia. And if youth was by no means arrayed entirely 
on the left, neither were blue-collar workers all to the right: those without a high 
school education came down hard against Mr. Nixon's Cambodian policy. A hefty 
56 percent opposed it, and only 26 percent approved. 


The question on the Kent State killings produced an unusually high number of 
'no opinions,' suggesting that the no-opinion column might harbor some people 
with qualms about the guard's behavior who were reluctant to say so outright. It 
also seems likely that some of those polled were suspending judgment about who 
was most to blame until the conflicting accounts of the shooting could be cleared 
up. But even if all those with no opinion were added to those who pinned major 
responsibility on the National Guard, a surprisingly strong majority of each 
group—by age, sex, education and political party—put the main blame on the 
protesters. 


Nixon as President U.S. Troops in Cambodia Who's to Blame at Kent Agnew's Stand 
How satisfied areyou Do you approve or Who do youthinkwas Do you approve or 
with the way disapprove of primarily responsible disapprove of 
Richard Nixon is President Nixon's for the deaths of Agnew's stand on 
handling his job decision to send four students at Kent dissenters and 
as President?" American troops State University? student protesters? 
to Cambodia? 
Very satisfied 3096 Approve 5096 The National Approve 4696 
Guard 1196 
Fairly satisfied 3596 Disapprove 39% Demonstrating Disapprove 30% 
students 58% 
Not too satisfied 18% No opinion 11% Noopinion 31% No opinion 24% 
Not at all 
satisfied 13% 


" Undecided not shown 


20 m DATA FOR STATISTICS 


1.5.5 "Lottery not random," Letters to the Editor of The Times, N.Y. Times, 
December 1969.* 
To the Editor: 
Inspection of the draft lottery results clearly shows a systematically increasing 
number of men being drafted as their birthdate falls later in the year. The odds 
against this trend resulting from random selection are over 100,000 to one. 


For example, twice as many men with December birthdates will be drafted, 
compared with those having January birthdates. This can be easily seen by 
plotting the average monthly draft number from January through December. The 
plot gives a nearly linear decrease in average age draft number (increasing draft 
risk) with date of birth. 


It is as if the capsules containing the birthdates were placed in the glass bowl in 
monthly order with January on the bottom and December on the top and then 
mixed or stirred too little for a random mix to be obtained. 


The monthly average draft numbers from January to December are approxi- 
mately: 201, 203, 226, 204, 208, 196, 182, 173, 157, 182, 140 and 122. Note that 
the first six months all have averages above the over-all average of 183.5, and the 
last six months averages are all below the over-all average. 


The coefficient of linear correlation between the order number of the lottery 
drawing and the order of the birthdate from January is —0.222, with a standard 
deviation of 0.052. If the drawings were random this coefficient would be very 
near zero. The chance of the coefficient being this far from zero is less than one in 
100,000. 


Men born in November and December with draft numbers below 184 should be 
given a new deal by having their 47 birthdates redrawn from a new lottery which 
would give them order numbers to be multiplied by 366/47 and then interlaced 
with the remaining present numbers. The October numbers show a statistical 
fluctuation toward fairness. 


Without this or a similar remedy these men will be subjected to an unfavorably 
biased treatment in opposition to the intent and spirit of the lottery. 

Fred T. Haddock 

Professor of Astronomy 

University of Michigan 

Ann Arbor, Mich., Dec. 5, 1969 


1.5.6 "18-20 Group Lags on Registering to Vote," Edward C. Burks, N.Y. Times, 
August 13, 1972.** 


The city has 360,000 residents who аге eligible to vote for the first time as a 
result of the lowered voting age, but they are far more apathetic than their elders 
about being registered. 


According to the Board of Elections, during 1971—the first year that people 
aged 18, 19 and 20 could register—only a third of them did so. 


* € |969 by The New York Times Company. Reprinted by permission. 
**©1972 by The New York Times Company. Reprinted by permission. 


1.5 SUMMARY EXERCISES m 21 


On the other hand, nearly 58 percent of all New Yorkers over the age of 21 are 
registered. 


Thus there has been a rather lackadaisical response by youth to the lowering of 
the voting age from 21 to 18, which came into effect with the ratification of a 
Constitutional amendment last year. The response could have profound implica- 
tions for the Democratic party, which has been counting on new youthful voters 
to help supply a margin of victory this fall. 


The new voters were, as expected, overwhelmingly Democratic in their party 
preference. The board's tabulations showed 64 percent of them signing up as 
Democrats, roughly the same percentage as their elders. In addition, 11 percent 
were recorded as Republicans, 7 percent as Liberals and 2.6 percent as Conserva- 
tives. More than 15 percent did not list a party affiliation. 


Analyzing 1970 census figures, the Community Council of Greater New York 
has determined that half of the city's future young voters—those now younger 
than 18—live in poverty areas. 


Each year, approximately 120,000 New Yorkers turn 18. Thus, at any given 
time in recent years the “pool” of young people aged 18, 19 or 20 has been about 
360,000. 


Using projections from the 1970 census figures, The New York Times has 
prepared maps, which show heavy concentrations of people in the new voting-age 
bracket in black and Puerto Rican neighborhoods designated as official poverty 
areas. 


The maps are divided into community planning districts, which generally 
coincide with one or several recognized communities. The district making up 
Bedford-Stuyvesant in Brooklyn, the city's largest black community, has the 
highest number of 18-to-20-year-olds, according to census projections—a total of 
12,193. 


There is no exact count of those aged 18 to 20 living in each of these districts 
today. However, everyone was tabulated by age at the time of the 1970 census, 
and demographers believe that a reasonable éstimate can be made of the 
present situation by using 1970 totals of young people who at that time were 16, 
17, and 18—and who are now 18, 19 and 20. 

The projections show major clusters of the 18-to-20 group in these other 
districts: Williamsburg, South Brooklyn, Crown Heights and East New York in 
Brooklyn, Morrisania in the Bronx, South Jamaica, Flushing, Hollis, St. Albans, 
Queens Village and nearby areas in Queens. 

Alexander Bassett, administrative manager of the Board of Elections, said that 
board personnel found great apathy in the poverty areas when trying to sign up 
new voters. 

Referring to last summer's major effort, when 25 vans were sent into neighbor- 
hoods, Mr. Bassett said it was an expensive “ор” because the program cost 
$250,000 and only some 40,000 people were registered. 


дес 3G 


22 m DATA FOR STATISTICS 


Up to last Monday, the total number of registrants all over the City this year 
was slightly less than 160,000. That figure includes those registering for the first 
time as well as those signing up again after moving or allowing their registration 
to lapse. 


Unless there is a considerable spurt in new registrations prior to the final cutoff 
in October (after four days of neighborhood registration Oct. 5, 6, 7 and 10), the 
number of New Yorkers eligible to vote for President will not be very much 
greater than in 1968. 


New registration is offset because about 10 percent of those on the list are 
purged each year, according to Mr. Bassett. 


Generally, a large proportion of the new registrants are those who have just 
reached voting age. 


In 1971, when the Board of Elections kept a rather cómplete score, it found 
that 29 percent—or 127,440 of the 440,000 registrants—were those in the new 
voting-age bracket. 


They were distributed by borough as follows: Manhattan, 14,885; Bronx, 
21,881; Brooklyn, 44,635; Queens, 40,812; and Richmond, 5227. 


This year only two of the boroughs—Queens and Richmond—continued to 
keep a count of the number of young people registering. Queens up to Aug. 7 had 
11,912; and Richmond, 2059. 


At the same time, however, Queens had more than 30,000 and Richmond more 
than 5000 young people becoming 18 this year and able to register. The figures 
show far less than half of the new group of 18-year-olds are registering. 


157 “Do poor people have more friends?", Shirley Sloan Fader, Family Weekly 
of the Sunday Record, Troy, New York, July 21, 1974.* 


И you enjoy watching “Тће Waltons,” you're involved with the current 
nostalgia fad that shows the Depression years as a time of great human warmth. 
Though most nostalgia about any era is unrealistic, the family-friend warmth idea 
connected with poverty may actually be true. A study of 4500 modern families’ 
leisure habits shows that to this day poorer people regularly out-socialize more 
prosperous families. The poor very frequently drop in on their neighbors, relatives 
and friends for an informal visit of helping out, TV watching or just sitting 
around; and they definitely keep closer overall visiting contact with their relatives 
than does the average prosperous person. People with good incomes drift away 
from spending time with friends and kinfolk toward leisure activities that must be 
bought and paid for—bowling, golf, trips, restaurants, movies, etc. It's true that 
purchased entertainment is often interesting and fun. But the old idea of spending 
üme with friends still satisfies a basic human need. 


1.5.8 “The dream of many, the realization of few," Barbara Hyland, The 
Stanford Daily, © 1968. 


*Reprinted by permission of the Family Weekly Magazine. 


1.5 SUMMARY EXERCISES m 23 


“The Perfect Crime" was almost committed at Stanford this Fall. In true Robin 
Hood fashion, with the cause of studenthood as their ideal, the heroes tried to 
finagle the grading in Psychology 1 so that no one failed. 


Their racket consisted of taking midterms and quizzes under fictitious names. 
By doing poorly, they could bring down the curve. They even handed in computer 
cards and blue cards so that the fictitious names would be placed on all of the 
computerized class lists. 


"If we get enough people to do it, no one can fail," enthusiastically explained 
one unidentified member of the class. Only fake people would get F's. 


However, they forgot the invincible Registrar's Office, which makes up class 
lists according to the cards each student hands in with his student number. These 
lists are then sent to each department. 


Acting on a hot tip-off, J. Merrill Carlsmith, one of the instructors of Psych 1, 
compared the Registrar’s list with the list of people who had taken tects. Even if 
he had not checked over the lists now, he said the discrepancies would definitely 
have been discovered when the final marks were recorded. 


Carlsmith said that the fake grades were included in calculating the mean for 
the midterm and some quizzes. He said, however, that there were “‘so few, it isn't 
having effect at all." Despite the grade conspirators optimistic force of 20 cohorts, 
their force was small. Only three fictitious people took the midterm. They got two 
F's and one D. Five nonexistent people took the recent quiz. One, who took both 
the midterm and the quiz, wrote Carlsmith a note asking if he could switch to 
Pass-Fail grading. 

Since Psych 1 has an enrollment of approximately 300, they barely lowered the 
mean. In the case of the midterm, the true mean should have been about two 
hundredths of a point higher. 


If there had been a significant number taking phony tests, thus altering the 
mean, Carlsmith explained that he would have just “pulled the cards and rerun 
them" through the computer. He remarked that it would be “ап awful nuisance" 
if it had been done in a class that didn't have computerized grading. 

So students took another loss in their struggle to outfool the authorities. But 
the dream still lives... 


2.1 INTRODUCTION 


We have seen that the reporting 
of information in  quantifiable 
terms can be misleading. We have 
begun to understand something 
about different kinds of data. One 
might say that the more we know, 
the more sophisticated is the 
measuring device we use. Thus we 
can look forward to a kind of pro- 
gression in our use of statistical 
methods of analysis (as shown 
opposite). 

The goal of this chapter is to 
present some graphical techniques 
for reducing a mass of data down 
to a form that lets us see the forest 
as well as the trees. These tech- 
niques include various tables, 
charts, and line graphs. Some de- 
vices are best for one type of data, 
others for other types. We will then 
be led into Chapter 3, where we 
can add numerical techniques to 
the graphical ones. 

Before we begin to discuss any 
techniques, we had better decide 
just what we hope to accomplish by 
any analysis we undertake. The ob- 
jective of statistical analysis is to 
answer very well-defined special 
questions. Let us use the data we 
have collected on the n=180 
upper-class students at the com- 
munity college and decide on some 
specific questions that might be ad- 
dressed to the data: 


1. What proportion of the 
sample is female? 

2. What is the average com- 
muting distance of the stu- 
dents in the sample? 


24 


Summarizing 
Data 
Graphically 


21 INTRODUCTION m 25 


PROGRESSION OF STATISTICAL ANALYSIS 


LE 
12. 


Discrete | Ordinal 


Continuous { 


Number of 
Statistical Tools 


Nature of Applicable 
Statistical Methods 


Very simple 


Type of Data 


Nominal 


A few more 


More sophisticated 
Most sophisticated 


Many more 
Most possibilities 


Interval 


. What is the longest distance a student commutes? What is the shortest 


distance? 


. Can we obtain an overall picture of student commuting distances? 
. Is there any difference between female commuting distance and male 


commuting distance? 


. Is there any relationship between a student's G.P.A. and the distance he 


commutes? 


. What is the political preference profile for these 180 students? 
. Is there any difference in the political preferences of men and women in 


the sample? 


. What is the position of the 180 students on the question of legalizing 


marijuana? 


. Is the overall position on legalizing marijuana affected by the sex of 


students or by student G.P.A.? 

What is the G.P.A. profile of these students? 

For each of the above questions, how can we generalize the results to 
the population of all upper-class students in the community-college 
system? Under what restrictions would we be willing to make state- 
ments about all the upper-class students? 


26 m SUMMARIZING РАТА GRAPHICALLY 


2.2 TABULAR SUMMARIES 


If we are to answer the questions listed above, we will have to take the 
collection of data given in Table 1.2.1 and tabulate them. We call the original 
observations raw data. They are in original form, assembled but unorganized, 
nutritious but uncooked. As soon as we start to tabulate information, 
these raw data will take on a new form; something of shape will be gained, 
something of detail will be lost in the transition. 


For example, in order to answer the first objective question: “What propor- 
tion of the sample is female?", we will have to count up the number of females 
in the sample. Let us record this information in Table 2.2.1. 


TABLE 2.2.1 180 Community College 
Upper-class Students, by 
Sex, Mecca Community 
College, 1974 


Frequency or 
Number eee 


TITLE 


Column HEADINGS 
CELL 


Row HEADINGS 


SOURCE 


Source: Table 1.2.1 


Table 2.2.1 illustrates the following characteristics of a good table: 


1. Simplicity. The simpler the table, the better. A good rule to use 
s "Only the specific data needed to answer one particular question 
should be in any one table.” 

2. Title. Every table should have a complete title. This title should identify 
the table's contents by what, where, and when. 

3. Headings. Every row and every column of the table should be labeled 
with a short, clear, and concise heading. If extensive detail of identifica- 
tion is required, use brief label plus footnote. 

4. Cells. Every table consists of a number of subunits called *cells." A cell is 
the intersection of one row and one column. The table thus is constructed 
in such a way that any observation (piece of raw data) can be placed in 
one and only one cell of the table. 

5. Source. If the data are being summarized from another source, the 
complete source must be put as a footnote to the table. 


2.2 TABULAR SUMMARIES m 27 


The following table was constructed for the tabulation (summary) of nominal 
data. Ordinal data can be summarized in similar fashion. 


TABLE 2.22 Full-Time Enrollment of Persons 14—34 Years 
Old in Two-Year Colleges, By Population 
Density of Place of Residence, United 
States, October 1972 
(Numbers in Thousands; Civilian Noninstitu- 
tional Population) 


Residence Density Male Female Total 
Metropolitan areas inside 219 149 368 
central cities 
Metropolitan areas outside 356 208 564 
central cities 
Nonmetropolitan areas 195 127 322 
Total 770 484 1254 


Source: Undergraduate enrollment in 2-year and 4-year colleges: October 1972, 
U.S. Bureau of the Census, Current Population Reports, Series P-20, No. 257, U.S. 
Govt. Printing Office, Washington, D.C. (1973), p. 15. 


When one makes tables for interval- or ratio-scale data, another dimension 
of complexity is added. It is not always clear how one should organize the row 
headings. For example, let us consider summarizing in tabular form the 
commuting distance information on the 180 students of Table 1.2.1. The 
following steps are suggested. After finishing them all, we will retrace the steps, 
perhaps eliminate some, and take shortcuts on others. 


STEP 1. Let us first reorder the commuting distances from the shortest dis- 
tance to the longest distance (Table 2.2.3). We note that the shortest distance is 
0.5 mile and the longest distance is 56.5 miles. We need to make up categories 
of distance so that we have a better understanding of the commuting distances. 


28 m SUMMARIZING DATA GRAPHICALLY 


TABLE 2.2.3 Commuting Distances Ordered From Small 


to Large 

0.5 2.5 4 5.5 7.5 11 15 205 275 
0.5 2.5 4 5.5 7.5 11.5 15 21.5 27.5 
0.5 2.5 4 6 75 11.5 15 21.5 28.5 
0.5 3 4 6 75 11.5 15 225 28.5 
0.5 3 4 6 8 12 15 225 28.5 
0.5 3 4 6 8 12 155 225 28.5 
1 3 4 6 8 12 155 235 29.5 
1 3 4 6 8 12 165 235 29.5 
1 3 4 6 8.5 12 17 24 29.5 
1 3 4 6.5 9 125 17 24 32 

1 3 4.5 6.5 9 12.5 17 24 32 

1 3.5 4.5 6.5 9.5 125 175 24 32 

1 3.5 4.5 У 9.5 125 19 25 35 

1 3.5 5 Z 9.5 13 195 26 35 

1.5 3.5 5 7 10 13 195 255 365 
1.5 3.5 5 7 10 13 195 255 365 
2 3.5 5 7 10 135 2 25.5 36.5 
2 3.5 5 7 10.5 135 20 265 46.5 
2 3.5 5 7 10.5 135 205 265 565 
2 3.5 5.5 7.5 10.5 145 205 0275 565 


STEP 2. The next step is to decide on a method of grouping the data into 
subsets representing intervals on the distance scale. You must be careful in 
selecting the boundaries of the intervals so that every distance in the data set 
falls into one and only one interval. In most cases this is easily done by noting 
the measuring units in which the data are taken and then using the next more 
refined unit of measurement for boundaries. For example, distance here has 
been measured to the nearest one half mile, so the intervals will be constructed 
using one fourth mile in boundaries. 


One further point needs to be considered: how many intervals should be 
selected? There is no simple answer to this question. For example, if an honor 
society had a rule that only students with at least a 3.00 G.P.A. were eligible to 
join, it would divide the data on G.P.A. in Table 1.2.1 into two intervals, 
0-2.99 and 3.00-4.00. If the State were interested in stimulating attendance at 
the community college by paying some transportation costs according to the 
following schedule, the data would be intervalized in accordance with that 
schedule. In this way, some idea of the ultimate cost could be developed. 


The chief criterion for selecting the number of intervals should be “Ном do the 
data have to be organized so that I can get some idea of the answer to the main 
question being asked about the data?" In cases where there is no basis for such 
a decision the usual rule is to take from 8 to 20 equally spaced intervals, with 
the criterion being to smooth the picture that would be obtained by plotting the 
data. Convenience of tallying data and reading pictured scales is taken into 


2.2 TABULAR SUMMARIES m 29 


account as well; thus 5-10—15—20-25 and so on is more convenient intervaliz- 
ing than 3-7-11-15-19 and so on. 


State contribution 
Miles ($ per month) 
0–4.99 10 
5-14.99 30 
15-29.99 60 
30-44.99 90 
45 and over 120 


For the data on distance there are no special conditions to affect our choice 
of intervals. On examing the data, we note that the shortest distance is 0.5 mile 
and the longest, 56.5 miles. In order to cover all distances, let us divide 60 
miles into 12 intervals of 5 miles each, starting with 0.25 miles. Then the 
intervals look as follows: 


0.25 5.25 10.25 15.25 20.25 25.25 30.25 35.25 40.25 45.25 50.25 55.25 60.25 


The data are then tallied into the various intervals, one observation at a 
time. Both the tallying and the summarizing of results are best accomplished in 
tabular form: 


TABLE 2.2.4 Commuting Distances (in miles) of Upper- 
class College Students 


Distance 
(miles) Tally Frequency 

0.25-5.25 ШИИ 59 
5.25-10.25 {ЖЖ Jf JT | 38 
10.25-15.25 HY JT JAE JAY JHE lil 28 
15.25-20.25 Wf JH Ill 13 
20.25-25.25 | ЈИ JH | 16 
25.25-30.25 Ш ИИ 15 
30.25-35.25 | 5 
35.25-40.25 || 3 
40.25-45.25 0 
45.25-50.25 | 1 
50.25-55.25 0 
55.25-60.25 || 2 
Total 180 


30 m SUMMARIZING РАТА GRAPHICALLY 


If we now use Table 2.2.4 in place of the original raw data, we have 
sacrificed some of the original information. For example, we know that 59 out 
of the 180 sampled students travel between 0.25 and 5.25 miles each way to 
campus, but we do not know anything about how the 59 distances are 
distributed along the interval. We have to assume that they are evenly 
distributed across the interval. So while tables help to clarify and summarize 
information, they do sacrifice some information in the process. 


EXERCISES 


Construct a table for showing how the opinion on legalizing marijuana differs 
with respect to sex. 


Using the data in Table 1.2.1, fill in the following tables by tallying the numbers 
of students for the various cells. 


Democrat 
Republican 
Other Party 


No 
Preference 


Party 
Preference 


Political Party Preference 


Opinion on Legalizing Marijuana 


2 3 4 5 
Strongly | Mildly No Mildly | Strongly 
Disagree | Disagree| Opinion | Agree Agree 


2.2.3 Based on what you find in the above three tables, discuss whether the data in 
these tables support or do not support the following statements: 


2.2.4 


a. 
b. 
с. 
а. 


е. 


In general, this sample of 180 students shows that young people would like to 
see marijuana legalized. 

While men prefer the legalization of marijuana, women do not. 

Men are more committed to political parties than women are. 

This sample has too many men in it;thus any ideas about generalizing should 
not be allowed. 

This sample of 180 students shows some interesting opinions, but I don't 
believe that all of the students would be similar to these students. 


The following questionnaire was distributed to Rensselaer Polytechnic students 
in the spring of 1973 by the (student) Union Programs Activity Committee 
(UPAC). 


How to get into a movie for nothing. UPAC is publishing the following 
questionnaire in an attempt to find out what the students prefer in entertain- 
ment. Future programming will reflect these opinions. 

Any student bringing a completed questionnaire to Friday night's showing 
of “Beneath the Planet of the Apes" will receive free admission for himself 


2.2 TABULAR SUMMARIES m 31 


and his date. Other students may return the completed questionnaires to 
the boxes at the Union Information desk and elsewhere on campus. 


Please circle the correct answer: 
1) What school are you from? 
RPI 
Sage 
Other. 


B5 


If from RPI, where do you live? 
Dorms 

Fraternity 

Off-campus 


> 


Are you a(n)? 
undergraduate 
graduate 


4 


= 


Are you? 
male 
female 


5) Are you? 

single 

married 

If married, how many children over 2 years of age do you have? 
(please fill in number). 


6 


= 


Did you know what UPAC was before reading this article? 
yes 
no 


7 


м? 


8) Do you attend UPAC movies? 
always 
frequently 
infrequently 
never 
9) Did you, in general, like UPAC's movie selection? 
yes 
no 
no opinion 


10) Do you attend UPAC speakers? 
always 
frequently 
infrequently 
never 


32 m SUMMARIZING DATA GRAPHICALLY 


11) Did you, in general, like UPAC's speaker selection? 
yes 
no 
no opinion 
12) Do you attend Beer & Flicks? 
always 
frequently 


infrequently 
never 


13 


— 


Do you attend Foreign Films? 
always 

frequently 

infrequently 

never 


14) Do you attend Coffee Houses? 
always 

frequently 

infrequently 


never 


= 


15) Do you go to bands in the Rathskeller? 
always 
frequently 
infrequently 
never 


16) Do you attend UPAC Cultural Events? 
always 
frequently 
infrequently 
never 


17 


— 


Which did you attend: (Circle those that apply) 
J. Geils 

Isaac Hayes 

Chicago 

Chuck Mangione 

Gary Burton 


18) Have you, in general, been satisfied with UPAC's level of programming: 
yes 
no 
no opinion 


19) If you wanted to suggest some type of program for UPAC, would you 
know where to go or whom to see? 
yes 
no 


Columns 
23 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 


Student 


Number 1 


oooocccc- 
oOcoocooc 
oooooooo 
-oooo-oc 
oooooo-o 
oooooooo 
oococooo 
oo-cooooo 
--oooccc- 
-oococcc 
оооооооо 
oooooooo 
oooooooo 
-oooco-o 
ocoooooo- 
oooooo-o 
емее 
—--0c-mcmo 
-cooooooo 
-r 000000 
ocooocc- 
oooooocc- 
oooooo-- 
(N CO Sf SF SF CO СЧ Сч 
ОЧРАР Р СО СЧ СЧ 
IIIN 
чаочочч<еУо 
о о Sf ч (0 со со со 
eMMe 
(СС 
0000 eee 
(0 00 Sf ч СЧ ON CO СЧ 
––-–се-сче 
oooooooo 
––––е--–- 
––––е-с~е 
ыы 
осмо) сое NMR 


––––е--–- 


Оон ОН ООО ооо ооо 


ооооо- о- - ооо - о-оо ~ 
ооооооооооооооооооо 
—-OO-oooc-coooc--c-o-o 
cOOOOocoOocoooooooooooo 
OoOO-cooooooooooooooo 
ооооооооо- оооооо- - о 
ooooooooocccoococcc- 
н О н ООО О н т 
оооооо- оооо- оооо- оо 
ооо Оооо оооо-оооо- 
ооооооооо- ооо-ооооо 
осо-осоосоооооосовосво 
—-oOoooo-ooc-cooocccc-cco 
oOoOooooocooooooccc-coo 
ooocoooocooooocooc-coo 
ессе ессе СМ сч NE СЧ GAL QN. 7. 7 
(N = СЧ О 00 СМ 00 (N с MK RK (00 — сас) RK 
O-co-cooooooooooooooo 
–-о-осоососооосо–-оосооо 
–-ео–-–––-оо–-оос–ео-е-ео 
OcOcOcc-cc-cc-cc-cooooooooo 
c7 ———-—-----00-o0c-coooco 
eo 00 СЧ сч сч сч со СЧ СЧ SF Sf. SP СО СО SF SF сос 
CN сч со со сч ч (NL со CO ч со сэ 4 СО 00 CO СЧ SF 
ч+чччочосчофчччеаччочч 
юоччочочоччочочочочч 
со осо соч сое ч «t соч соч еч 0 OOS 
(9 00 (0 e с) (0 со — со (0 (0 С) 7 (0 (0 (0 СЧ сч 
«MEO CO 4 4 SECO 0 0 4 0 0 0 CO 4 0 0.9 
c (00 — (0000 — Nee eK KK СМ 
CN f CO f SE P SE С) 00 00 СМ 0) СЧ сч с СМ 00 СМ СЧ 
coco Ne Ne m т mmm Kee 
OOOOOooooooooooooooo 
еее 
––––—чечесче-е-----с-е- 
–че-че--счесчев-е-в---е 
MONK MEK NK MK MMR MOK KKK 
eercrrrerrerererrerrer 


REEE EEE EE EEE ts 


S8 
SS 


33 


о-о оо О О О 
о -оо--- о-о = 
о ооо-ооооооо---ооо 
o Of Or Or Or Or Orr oror 
Oooooooooooooooooo 
о ооооооооооооооооо 
–о–––о––о–оо--оо-о 
= –оо––––-––о-–--о 
о-оо оО 
OOccocccococ-cooo-oo 
oOoooooooooooooooco 
Ocooooooooooooooc- 
oooooooooooooooo-o 
Occcccoccoccco-coc 
OO0cc-coo-co-o-oooo-oo 
oOooooocc-ooooooooo 
мачесмчен-есчаче-----сч 
обе-сте--------–-ее- 
OoOooooooooooooooooo 
Oooooooooooooooooo 
OoOo-coooc-co-ccoccccoo 
oOocoooooooooc--o-o- 
Ocooooo-ooooooo--o 
чяччочсосочсоч сач о 0 0 00 07 
сч сч сч со со со сч сч со со со QNSE со см сч со со 
САЛА NE SE SE SEO SE OO SE OO 0 B 
(0 «f CN SE SE SE SECO O0 CO SE SE CO SE MM ЧЇ 
сч = сч ср 00 со сч сч со со сч со со (0.0 — с со 
го = сч со со со со = со со — со со 0 00 сос со 
Mf NO «f CÓ «f C0 соч ч 00 00 со со SF СЧ со 
MeeNnNerereeN ereere e 
ч счсч со сч со сч єч со сч со сч сч с ENNAN 
eererererrererreerre 
oooooooooooooooooo 
—--c-c-ccccccccccccc 
Ерера 
eerNnerererreNerre ere 


= 


––—о––-–—че--сче-счесчс 


1 


311 


34 m SUMMARIZING РАТА GRAPHICALLY 


20) Circle those areas in which you would like to see more programming. 
Coffee Houses 
Beer and Flicks 
Popular Films 
Foreign Films 
Speakers 
Cultural Events 
Bands 
Big Concerts 


21) How do you find out about UPAC’s events? 
Poly article 
Poly Calendar Events 
Other newspapers, radio or T.V. 
UPAC Calendar of Events 
UPAC Bulletin Board 
UPAC Posters 
Word of mouth 


The data were collected from a total of 550 students. The table on p. 33 is an 
excerpt from a computer printout of the data for students Number 284-328 in- 
clusive. Examine the data sheet. Without the coding scheme, can you make any 
sense out of the data? 


2.2.5 The following coding scheme was used for the computer: 


Column on 
Question Category Code Data Sheet 
ааа Oca RN 
1 RPI 1 
Sage 2 1 
Other 3 
2 Dorms 1 
Fraternity 2 2 
Off campus 3 
3 Undergraduate 1 
Graduate 2 3 
4 Male 1 
Female 2 Н 


2.2 TABULAR SUMMARIES m 35 


Column on 
Question Category Code Data Sheet 
5 Single 1 
Married 2 Ы 
6 Number of children 
7 Yes 1 
No 0 
8 Always 1 
Frequently 2 
Infrequently 3 $ 
Мемег 4 
9 Yes 1 
No 2 9 
No opinion 3 
10 coded like 8 recorded in column 10 
11 coded like 9 recorded in column 11 
12 coded like 8 recorded in column 12 
13 coded like 8 recorded in column 13 
14 coded like 8 recorded in column 14 
15 coded like 8 recorded in column 15 
16 coded like 8 recorded in column 16 
——— —À— A — ЫЗ e os 
Question 17 Code Column 
ан а АЕ ЕРЕ ЫНЫР ја а ee 
J. Geils 1 attended 
0 did not attend 7 
Isaac Hayes 1 attended 18 
0 did not attend 
Chicago 1attended 19 
0 did not attend 
Chuck Mangione 1 attended 20 
0 did not attend 
Gary Burton 1 attended 21 


0 did not attend 
= — ee ee ee 


Question 18 coded like question 9 and recorded in column 22; question 19 
coded like question 7 and recorded in column 23. Questions 20 and 21 use the 
code 1 for yes and 0 for no, recorded in columns as follows: 


36 m SUMMARIZING DATA GRAPHICALLY 


Question 20 Column 

S EN rU АИ ЛИВНИЦА о 
Coffee houses 24 
Beer & flicks 25 
Popular films 26 
Foreign films 27 
Speakers 28 
Cultural events 29 
Bands 30 
Big concerts 31 

fies ioa ue СЬС pO < СЕ 

Question 21 

ol 20165 1T AR PE 
Poly (school paper) article 32 
Poly calendar of events 33 
Other newspapers 34 
Radio or T.V. 35 
UPAC calendar of events 36 
UPAC bulletin board 37 
UPAC posters 38 
Word of mouth 39 


а Бој НЕ mono c tat ee 
a. Tabulate the following information: 
Frequency 


Undergraduate 
Graduate 


b. Did the students like or dislike UPAC's movie selections? 


с. What percentage of those who attended UPAC's movies liked the movie 
selection? 


2.2.6 Examine magazines and newspapers and find three samples of tables of discrete 
data. Identify the kind of scale (nominal, ordinal, interval, ratio). 


2.2 TABULAR SUMMARIES m 37 


2.2.7 Using the data obtained from the survey in Exercise 2.2.5, fill in the following 
table: 


Attendance at Foreign Films 


Living Quarters Infrequently | Frequently Totals 


Dorm 
Fraternity 
Off campus 


Would you be willing to make some statement about the attendance at foreign 


films as it relates to where the students live? 


2.2.8 Fill in the following table: 


Attendance at UPAC Events 


UPAC Movies 


a. How many students were frequent UPAC movie goers but infrequent 
UPAC cultural events goers? 
b. How many students never attended movies or cultural events? What percen- 
tage of the total students does this represent? 
What percentage of the students attended at least one UPAC movie? 
. What percentage of the students attended at least one UPAC cultural event? 
e. If one defined UPAC activities as successful if attendance at both movies and 
cultural events was either frequent or always, would you say that UPAC was 
successful or not as a result of this survey? Why? 


e 


38 m SUMMARIZING DATA GRAPHICALLY 


2.3 GRAPHICAL PRESENTATION 


The presentation of data in tables can be further clarified to the reader by a 
graphical presentation of the same data. It is the authors’ opinion that any 
interpretation or analysis of data should include a graphical presentation of the 
information. 


A. Graphical Methods for Nominal and Ordinal Data 


The most useful graphical technique for presenting nominal or ordinal scale 
data is the bar chart. 


Bar Chart. A bar chart is a diagram consisting of vertical (or horizontal) 
bars which represent the frequency of observations in specific categories. There 
are several useful guidelines that should be followed in constructing a bar chart: 


1. 


Generally the bars for categories should be separated so that there is a 
distinct uniform space between bars. 


. The bars must all start from the same base. 


- It is not advisable to include figures inside or on top of the bars. This 


creates the illusion of shortening or lengthening the bars. 


. Since a graph is drawn for the purpose of enabling the reader to gain a 


quick picture of the information in the data, it is advisable when dealing 
with nominal data to arrange the bars in ascending or descending order 
of magnitude. 


. If more than one color (or shading) is used, a key for colors should be 


prominently displayed. 


. Every bar chart should have a title, and if the data are taken from an 


external source, the source should be indicated at the bottom of the 
chart. 


2.3 GRAPHICAL PRESENTATION m 39 


Let us take the table constructed on the opinion on legalizing marijuana 
(Exercise 2.2.2) and draw a bar chart for it. 


Number of students 
= 
о 


30 
20 
10 
0 "е 
Strongly Mildly No Mildly Strongly 
disagree disagree opinion agree agree 


Opinion on legalizing marijuana, by intensity of opinion (sample of 180 
students). 
Source: Sample of 180 upper-class students, Table 1.2.1. 


Note: Since the horizontal scale is a set of ordered categories, the ordering of 
the bars by their frequency heights is not possible. 

As a further example of using bar charts, the opinion on legalizing marijuana 
has been further broken down by sex. This is shown in the following bar chart, 
where the categories of opinion are separated into sets of two bars, male and 
female. 


40 m SUMMARIZING DATA GRAPHICALLY 


50 


L] Male 


40 
[^ 

a re 
$ 
3 
© 30 
© 
3 
Е 
2 20 

10 


Strongly Mildly No Mildly Strongly 
disagree disagree opinion agree agree 


Opinion on legalizing marijuana, by sex and by intensity of opinion (sample of 
180 students). 


Source: Sample of 180 upper-class students, Table 1.2.1. 


Note: Since the horizontal scale is a set of ordered categories, the ordering of 
the bars by their frequency heights is not possible. 


A second graphical technique useful for nominal data in particular is the pie 
chart. 


Pie Chart. The pie chart is a circle in which the component percentages of 
the total sample are plotted by converting them to degrees. In plotting 
categories of nominal-scale data in this manner, there are a few guidelines 
which prove useful: 


1. Start the division of the circle at 12:00 o'clock. The reason for this is 
that people tend to read a clock in a clockwise direction from 12:00 
o'clock. 

2. In order to put the emphasis on that category with the highest frequency 


(number of occurrences), the categories are usually plotted in the order 
of descending frequencies. 


Let us use the data on political party preference to show the construction of 
the pie chart. 


2.3 GRAPHICAL PRESENTATION m 41 


Political Party Preference 


Fraction Degrees 
Frequency of Total (Fraction x 360°) 
60 1 1. ee 
Democrat 60 78073 3% 360 = 120 
Republican 50 AS %х360= 100° 
Other parties 6 Wo 30х380- 12 
No party preference 64 5116 18360 128° 


Using the degrees in the last column, the pie chart is constructed, starting at 
12:00 o'clock. 


V 
\ 


| Мо 
Republican political 
preference 


Democrat 


Political party preference among 180 Mecca Community College students. 
Source: Sample of 180 upper-class students, Table 1.2.1. 


42 m SUMMARIZING DATA GRAPHICALLY 


EXERCISES 


2:34 


2.3.3 


2.3.4 Using the data shown in Exercise 2.2.4, construct: (a) bar chart showing the 
frequency of preference for more UPAC programming in the areas shown in 
question 20, and (b) pie chart showing the attendance at UPAC cultural events. 


Using the data in Table 1.2.1, construct a table showing the joint breakdown of 
political party preference and opinion on legalizing marijuana. Prepare a graphi- 


cal representation of these data. 
Construct graphical presentation of the data from the following table: 


Employment of Persons Aged 14 Years and Over as of March 1968 


Men Women 


Numbers іп Median Numbersin Median 
Thousands Income Thousands Income 


Total 66,519 5,571 73,584 1,819 

Employed 47,622 6,610 27,887 3,157 

Unemployed 1,680 3,017 1,332 1,382 

Armed Forces or 17,217 1,634 44,365 913 
Not in Labor Force 


Source: Employment Status and Occupation—Persons 14 years old and over by total money income 
in 1967, by sex, for the United States, U.S. Bureau of the Census, Current Population Reports, Series 
P-60, No. 60, U.S. Government Printing Office, Washington, D.C. (1969). 


Using the following chart, called a segmented bar chart, indicate why you think 
the chart was done this way instead of with three bars per year as indicated in 


the text. Do you think this is a useful chart? 


100 


Federal 


~ 
a 


50 


25 


Percent of total funds 


1960 1965 


Source of funds for XYZ Health Department, 1960 and 1965. 


2.3 GRAPHICAL PRESENTATION m 43 


B. Graphical Methods for Interval and Ratio Scale Data 


When data are recorded using a continuous scale, there are three very useful 
graphical methods: (a) the histogram, (b) the frequency polygon, and (c) the 
cumulative frequency polygon. 


Histogram. The histogram is a bar chart with the bars not separated; the 
bases of the bars are on one continuous scale. 


Let us use the commuting-distance data that we tabulated in the previous 
section to demonstrate the construction of a histogram. The data are repeated 
in Table 2.3.1 for convenience. 


TABLE 2.3.1 Commuting Distances (in miles) of Upper- 
Class Community College Students 


Cumulative percentage 


Cumulative (in decimal 
Interval Frequency Frequency form) 
0.25-5.25 59 59 .33 
5.25-10.25 38 59+38= 97 .54 
10.25-15.25 28 97+28 = 125 .69 
15.25-20.25 13 125+ 13= 138 77 
20.25-25.25 16 138 + 16 = 154 .86 
25.25-30.25 15 154+ 15 = 169 .94 
30.25-35.25 5 1694 5-174 .97 
35.25-40.25 3 174+ 3= 177 .98 
40.25—45.25 0 177+ 0= 177 .98 
45.25-50.25 1 177+ 1= 178 .99 
50.25-55.25 0 178+ 0 = 178 .99 
55.25-60.25 2 178+ 2-180 1.00 
Тога! 180 
step 1, Plot the intervals on the horizontal axis of arithmetic line graph 


paper. 


44 m SUMMARIZING DATA GRAPHICALLY 


Number of students 
[^] 
о 


20 
10 
0 A 
0.25 10.25 20.25 30.25 40.25 50.25 60.25 
Miles 


Commuting distance of 180 community college students. 


STEP 2. Each bar will be 5 miles in width and have a height equal to the 
frequency of that interval. Thus the first bar will begin at 0.25 mile and end at 
5.25 miles and have a height of 59, representing the 59 students who commute 


between 0.25 and 5.25 miles from their home to school. The entire histogram 
is shown above. 


Special attention is required in constructing a histogram when the intervals 
are unequal in width. Adjustment of frequency plots must be made to prevent 
misinterpretation. The procedure is best seen in an example, as follows. 


In an article there appears the following table [Auer, Eugene S., Carcinoma 
of the cervix uteri, J.A.M.A. 98 (26), 2260 (1932); the frequencies have been 
modified for ease of computation]. 


Patients Treated for Carcinoma of the Cervix, by Age 
Barnard Free Skin and Cancer Hospital, St. Louis, Missouri, 1906-1926 


Age Number of Patients 
22-30 16 
30-35 45 
35-40 79 
40-55 225 
55-60 63 
60-70 46 


70-90 12 


2.3 GRAPHICAL PRESENTATION m 45 


In creating a histogram for these data, remember our assumption about the 
observations within each interval. In the first age interval (22-30) we have 16 
patients; our assumption is that the 16 patients are spread out evenly over the 
8-year interval. In the age interval 40—55, 225 patients are spread out evenly 
over a 15-year span. In order to plot the data to reflect this we first must 
choose one size interval as a standard on which to base the plotting. Usually 
one plots on the basis of the smallest interval cited; in our example, a 5-year 
age span is the choice for standard. The bar heights are adjusted to conform to 
the standard. For the data of the above table, the adjusted heights of the 
histogram are shown as follows: 


Number of Patients 
for Histogram 
Number of Plotona 
Age Patients 5-year Basis 
5 = 
22-30 16 aX 16- 10 
30-35 45 2x45 = 45 
35-40 79 2x79= 79 
40-55 225 5 x 225=75 
55-60 63 2х 63=63 
5 
60-70 46 qp 467 23 
70-90 12 512-3 
20 


The histogram correctly plotted now comes out as follows: 


46 m SUMMARIZING РАТА GRAPHICALLY 


80 
70 
60 
50 
40 
30 


20 


Frequency (in 5—year age span) 


Note that the above picture is the correct one. If you plotted the observed 
frequencies without the adjustment, the picture would be completely mislead- 
ing, since the 225 frequency would incorrectly dominate the picture. Thus, you 
must be very careful when drawing graphs of data which are collected in tables 
with unequal intervals. 


The next logical kind of graph is one that replaces the histogram with a line 
graph called the frequency polygon. 


Frequency Polygon. The frequency polygon is constructed by connecting 


the midpoints of the tops of the histogram bars with straight lines and 
then removing the bars. 


Frequency (in 5—year age span) 


2.3 GRAPHICAL PRESENTATION m 47 


Frequency (in 5—year age span) 


Age 


Obviously you don't have to go through the histogram routine if all you are 
interested in is the frequency polygon; the required points can be plotted 
without drawing and erasing bars. 


The frequency polygon is the graphical method most often used for plotting 
continuous data. Please remember, however, that our polygon example has 
been drawn taking into account unequal intervalized data. Otherwise it too 
would have been as misleading as an incorrectly drawn histogram. 


Cumulative Frequency Polygon. А very useful graph for many purposes is 
the cumulative frequency polygon. To construct this type of graph we must first 
calculate the cumulative frequencies. Examples are shown in the third column 
in Table 2.3.1. 


For example, 59 students commuted less than 5.25 miles, 97 students 
commuted less than 10.25 miles, that is, the 59 of the 0.25-5.25 interval plus 
the 38 who commuted anywhere from 5.25 to 10.25 miles. Since the cumula- 
tive frequency cumulates to the end of the interval, one plots the cumulative 
frequency polygon by taking points at the ends of the intervals. 


Cumulative Percentage Polygon. Another very useful technique is to plot 
the cumulative percentages rather than the cumulative frequencies. The cumula- 
tive percentages for our distance data are shown in column four of Table 2.3.1. 
Plotting at the ends of the intervals, the cumulative percentage polygon is 
shown in Figure 2.3.1. 


48 m SUMMARIZING DATA GRAPHICALLY 


0.00 
0.25 5.25 10.25 15.25 20.25 25.25 30.25 3525 40.25 45.25 50.25 55.25 60.25 


FIGURE 2.3.1 Cumulative percentage polygon of commuting distances (miles) of upper- 
class community college students—Mecca Community College. 


By using this graph we can read off any percentage point we would like. For 
example, we can find the 50th percentage point. Tracing across from .50 to the 
polygon, and then following a vertical straight line from that point to the 
horizontal axis, we find that 50 percent of the students in our sample of 
п = 180 travel less than 9 miles and the other 50 percent of the students travel 
more than 9 miles. 


We can also determine a completion to the statement: 95 percent of the 
students travel no more than ? miles. Reading this off the curve, we find it to 
be 33 miles. 


2.3 GRAPHICAL PRESENTATION m 49 


EXERCISES 


2.3.5 In an article [Welton, D. G., Inside dermatology, U.S.A.—from a national 
survey of private office practice, South. Med. J. 53 (2), 210-223 (1960), Table 


2], the following data on patients were shown. 


Age and Sex of Patients Seen During Average 4- 
Week (Interrupted) Period 


Age Male Female Total 
Under 10 16 19 35 
10-19 41 59* 100 
20-39 62 100 162 
40-49 34 45 79 
50-69 55 66 121 
70+ 16 16 32 
Total 224 305 529 


* Number modified for our purpose 


a. Draw a histogram for each sex showing the age distribution of patients. 

b. Using a frequency polygon, show the age distribution of the total patient 
load. 

c. Using a cumulative percentage polygon, determine the following: (a) 
under what age 90 percent of all patients are, (b) below what age 90 percent 
of all women patients are, and (c) what age category is the most prevalent 
(justify your answer). 

2.3.6 In the example on carcinoma of the cervix (page 44), the author stated “... it 
will be noted that almost a majority of the patients fall into the age group 


40-55, commonly known as the cancer age.” 
a. Discuss the correctness of this statement, using the numerical data and the 


graphs that have been constructed in the text. 
b. Is the author’s age grouping meaningful? Why? 
c. Is the grouping of ages convenient for rapid comparisons by age? If not, 


what would you recommend? 


50 m SUMMARIZING DATA GRAPHICALLY 


2.3.7 In a study [Greene, G. R., and Sartivell, P. E., Oral contraceptive use in 
patients with thromboembolism following surgery, trauma, or infection, Am. J. 
Public Health 62 (5), 680-685 (1972), Table 2], the age at hospitalization 
distribution of the cases and controls responding to a questionnaire were shown 
as follows: * 


Age at 
Hospitalization | Responding 


a. Using graphical methods, compare the age distribution of the case respond- 
ents to the control respondents. 

b. Compare the age distribution of cases and controls among the nonrespond- 
ents. 

c. Is there а difference between cases and controls relative to their disposition 
to respond to the questionnaire? Justify your response graphically. 


2.3.8 In the study referred to in Exercise 2.3.7 above, another table showed the 
"education-grade completed" for the people responding: ** 


с 


Education Grade 
Completed Cases Controls 
EL ан ЗЕЦ АРЕ есе. ОНИ 

8orless 5 1 
9-11 4 4 

12 14 21 

13-15 22 20 

16 6 8 

17 or more 8 6 


——————————D 


a. Show in graphical form the education relationship tabulated here. 
b. Is there anything misleading in these tables? State why your graph or graphs 
clarify the situation. 


"Copyright © 1972 by the American Public Health Association, Inc. Reprinted by permission 
of the author and by the publisher. 


** Copyright © 1972 by the American Public Health Association, Inc. Reprinted by permission 
of the author and publisher. 


2.3 GRAPHICAL PRESENTATION m 51 


2.3.9 In the Chicago Tribune (Monday, May 8, 1972, Section 1, p. 2), the following 
table on West Side Poverty was printed. 


Poverty On The West Side 
(U.S. Census Bureau Study of Low-Income Areas’) 


Families with Male Head Families with Female Head 


Negro Latin White Negro Latin White 


Total persons 65,146 27,557 56,169 84,087 24,059 55,213 
Those with in- 

comes below 

poverty level 13741 4,277 9,429 28,400 6,101 11,036 
Percent below 

poverty level 2196 1696 1796 3496 2596 2096 
Poverty level 

unemployed 6,971 1,340 2,865 6,543 1,112 1,355 


? The area surveyed and reflected in the statistics included most of the West Side as well as some 
economically depressed areas of the North Side in which large numbers of Negroes, Latins, and persons of 
Appalachian background live. The survey was taken following the 1970 census. 


a. Using the data in this table, present a picture of the poverty on the West 
Side of Chicago by graphical techniques. Use more than one kind of graph. 
b. What can you say about the growing poverty problem on the West Side? 


2.3.10 The following graph appeared in a newspaper article. 


35,000 
34,000 
33,000 
32,000 
31,000 
30,000 
29,000 
28,000 
27,000 
26,000 


25,000 


24,000 
1964 1965 1966 1967 1968 1969 1970 1971 


a. What conclusions can you draw from this graph? 
b. Criticize the graph. State its limitations. 
c. What recommendations would you make to improve the graph? 


52 m SUMMARIZING DATA GRAPHICALLY 
2:3: T 


Where it comes from 


The following graph on a “Baseball Team Budget" appeared in the New York 
Times on April 9, 1972.* 


Where it goes 


Club 
expenses 

(salaries, etc.) 

2096 


Stadium 
costs 
2296 


Farm 
operations 


20% 
Miscellaneous 
3% Miscellaneous 
Advertising Radio—T.V. 10% : 
2% 12% 
Road receipts_/ Food and souvenirs 
4% 


Player bonuses, etc. L Promotion 
7% 
12% 


7% 
Anatomy of a baseball-team budget. 


you made. 


a. Discuss the positive and negative aspects of this graphical presentation. 
b. Show how you would have drawn the two graphs, and justify any changes 


2.3.12 The following table shows the progress of a major U.S. corporation. 


Three Months to 
June 30 1974 1973 
Sales* $447,700 $384,800 
Net Income” 25,359 24,173 
Earnings per share 1.22 1.17 
——— ee eee ee —— 
Six Months to 
June 30 
Sales* 834,000 755,000 
Net Income^ 44,869 
Earnings per share 


48,154 
2.16 2.32 
eee 
“The average number of comm 
20,785,858 


оп shares outstanding is 
? In thousands of dollars 


a. Draw a graph showing the rise in earnin 
these time intervals. 


25 per share for this corporation in 
b. If you were a stockholder, would 
this company? Why? 


you be concerned about the progress of 
* 
9 1972 by The New York Times Company. Reprinted by permission. 


2.3 GRAPHICAL PRESENTATION m 53 


2.3.13* The following excerpt from an article, *Continued Job Declines Threaten City 
Economy," appeared in the Sunday edition of the New York Times, July 21, 
1974. Using the information in the article, show graphically any facts you 
would like brought out. 


New York City is losing at an accelerating rate the jobs that sustain its economy 
and its government, and the declines are causing growing concern among city 
officials, private economists, business men and labor leaders. 


There were, on the average, 24,000 fewer jobs in the first four months this year 
than last year. 


Leading the declines are jobs in manufacturing. These are just the jobs that are 
most needed here to provide entry opportunities for the city's increasing popula- 
tion of poor and unskilled Puerto Ricans, blacks and other minorities. 


Though other American cities also are losing jobs, and some of them at a faster 
rate than New York, the problem here is more serious because New York is 
bigger and therefore is losing more—251,000 jobs in the last four years. 


The decline, which manifests itself in empty lofts and factory buildings, a high 
rate of unemployment (7 percent locally compared with 5.2 percent nationally) 
and a huge burden of welfare dependency, is confirmed by an array of statistical 
evidence that has been pointing downward since 1969. 


After a decade of employment growth in the nineteen-sixties, the city lost 
53,000 jobs in 1970, 135,000 in 1971, 49,000 in 1972 and 14,000 in 1973. The 
losses wiped out all the gains achieved in the previous decade. 


By far the largest part of the decline—169,000 jobs—was in manufacturing 
employment, which had been falling continually through the fifties and sixties as 
well. However, in those years, the manufacturing losses were made up and even 
exceeded by growth in office work, services and government employment. 


2.3.14 The following data were published by Rinfret Boston Associates, Inc., 1974, 
Prices and Production, An Economic Analysis of Softwood Lumber and 
Plywood, 1970—73, page 54, showing the distribution of softwood in 1970.** 


Lumber Plywood 
(percent) (percent) 


Residential 

construction market 50 59 
Nonresidential market 9 11 
Industrial market 20 16 
АП other markets 21 14 


Organize these data into pie charts. 


* 61974 by The New York Times Company. Reprinted by permission. 
**Reprinted with permission of the North American Lumber Association, Inc. 


54 m SUMMARIZING DATA GRAPHICALLY 


2.3.15 In the pamphlet “Pneumoconiosis in Coal Miners,” Public Health Service 
Publication No. 2000, Figure 20 shows a three-dimensional plot of the relation- 
ship between age and years worked underground on roentgenographic findings 
of definite pneumoconiosis among working and nonworking miners. Write 
down your assessment of this graphical technique and draw any inferences from 
the figure you would like to make. 


Working miners Nonworking miners 


Roentgenographic findings of definite pneumoconiosis by age and years 
underground among working and nonworking miners. 


*25.0-the percentage of 55-64 


year old miners who had worked more than 40 years underground and had 
definite pneumoconiosis. 


2.3 GRAPHICAL PRESENTATION m 55 


2.3.16 Using the following data and the previous exercise as an example of a 
three-dimensional plot, make a similar chart relating smoking behavior and 
years worked underground with history of persistent coughing. 


Years Degree of Nonsmokers and 
Underground Cough Previous Smokers Present Smokers 

None 182 233 
0-9 «Persistent* 22 57 
Persistent 5 35 
None 177 297 
10-19 «Persistent 20 74 
Persistent 3 28 
None 209 286 
20-29 «Persistent 41 113 
Persistent 16 74 
None 187 141 
30-39 <Persistent 34 89 
Persistent 16 45 
None 57 30 
40+ <Persistent 14 19 
Persistent 4 18 


* <Persistent=less than a persistent cough 


2.3.17 The distribution of the U.S. land mass in January, 1970 is shown in the 
following table taken from the U.S. Department of Agriculture, Outlook for 
Timber in the United States, October 5, 1973, pp. 225-226. Draw a graph or 
chart of these data, explaining why you chose the particular graph you used. 


U.S. Land Mass* 
(January 1, 1974) 


Million Acres Percent of Total 


Crop land 427.0 18.8 
Commercial forest 499.7 22.0 
Noncommercial forest 253.9 11.2 
Unproductive 233.9 10.3 
Productive reserved 17.2 0.8 
Deferred 2.7 0.1 
Other lands 1089.5 48.0 
Total 2270.1 100.0 


a Modified for class use. 


56 m SUMMARIZING DATA GRAPHICALLY 


2.3.18 The following graph* on the cost of living in ten countries appeared in the 
Saturday Review/ World on July 27, 1974. Interpret in words what you see in 
the graph. Criticize the graph constructively. * * 


Money Cost of 
supply Money Index of living 
supply cost of living 
700 ———— ($ Billions) (1963 = 100) — 150 
1958 $217 90.0 
1959 225 91.0 
1960 232 928 
1961 249 947 " 
— 1962 266 97.2 14 
р 1963 289 100.0 
1964 307 102.5 
1965 332 105.5 
1966 348 109.3 
1967 378 1124 
1968 415 117.0 
500 —————— 1969 441 1126 130 
1970 485 129.6 
E Ae 1971 550 136.4 
1972 625 142.4 
ae wes Ten Countries @ 120 
Beigium Netherlands 
400 Canada Switzerland 
France United Kingdom 
Germany United Statés 
110 
300 
100 
200 90 


1958 1959 1960 1961 1962 1963 1964 1965 1966 1967 1968 1969 1970 1971 1972 1973 


"Chart shows total money—supply increase of ten major countries 
with corresponding increase in cost of living index, 1958—1973. 


* Source: International Monetary Fund Financial Statistics, various issues. Compiled by Sidney 
E. Rolfe from issues of the International Monetary Fund Financial Statistics. 
**Reprinted by permission from Saturday Review/ World, © 1974, 


2.3 GRAPHICAL PRESENTATION m 57 


2.3.19 The following graph appeared in the Raleigh, North Carolina News and 
Observer, Sunday, May 6, 1973: 


The chart shows the number of drivers in different age groups involved in 
North Carolina fatal crashes in 1972. 


a. Using this graph, what implications and conclusions would you draw about 
driver's age and fatalities? 


b. How would you correct the graph? What conclusions would then be drawn? 


Drivers by age 


15 
and [ 
under 


55—64 174 


65—74 117 


3.1 INTRODUCTION 


We began our study with a sam- 
ple of п=180 observations on 
upper-class students at Mecca 
Community College. Our objec- 
tives are to characterize or describe 
these students as best we can, and 
on that basis to make some state- 
ments about all the upper-class 
students at Mecca Community Col- 
lege. In Chapters 1 and 2 we have 
concentrated on tabular and 
graphical representations of our 
sampled student information. We 
have recognized that we are re- 
stricted in what we have been able 
to do because of the kinds of data 
we have collected. 

In this chapter we shall turn our 
attention to the definition and use 
of various statistical measures. 
These will be numerical quantities 
aimed at summarizing significant 
features of the data. We call any 
calculated number coming from a 
set of sample data a "statistic." 
Only in this sense does the singular 
noun "statistic" appear. If we are 
careful, these sample measures will 
help us to make inferences about 
all upper-class Mecca students. 
However, this will be an ultimate 
goal of this book, and only the 
beginnings will be shown right 
now. 

Our major concern will be with 
two kinds of statistical characteri- 
zation of sample data: (a) measures 
of centrality, and (b) measures of 
variability. 


Summarizing 
Data 
Numerically 


“Tonight, we're going 10 let the statistics speak for themselves.” 
Drawing by Koren; © 1974 The New Yorker Magazine, Inc. 


3.2 MEASURES OF CENTRALITY 


The use of "average" in everyday conversation abounds, Everybody makes 
statements like “Joe's batting average is .303," or “I spent an average of $34 a 
week for groceries for my family of four last year," or “the average income for 
middle-class whites is $8000." All uses of “average” refer to a concept that 
things tend to cluster about some central value. In some fields like science, the 
“average” refers to the “center of gravity.” In statistics, we recognize several 
kinds of "averages"—we have to do this since we deal with all kinds of data 
and, as we have pointed out before, each kind of data requires special ways of 
handling and characterization. Let us define and discuss several of these 
measures of centrality. 


60 m SUMMARIZING DATA NUMERICALLY 


A. Arithmetic Mean 


Тће most commonly used characteristic of continuous data is the arithmetic 
mean. It is the “average” which most of us grow up with: add up all the 
observed values and divide by the number of entries. The arithmetic mean is 
often described as the center of gravity or the balance point for a set of 
observations. So widespread is its use that the adjective “arithmetic” is usually 
omitted these days, and mean is taken to indicate the arithmetic mean unless 
some different adjective is specifically stated. The formula for the arithmetic 
mean is: 


Sum of observations 
Number of observations 


n 


(Arithmetic) mean equals (3.2.1) 


Note in this formula the use of the capital Greek letter sigma “У” to denote 
"sum of." We shall make free use of this shorthand symbol, and its meaning 
will always be the same. The letter n will also be used consistently to denote 
the number of observations. 


A more precise way to indicate the arithmetical operations used in Formula 
(3.2.1) is to write it as follows: 


yoo (3.2.1) 


The subscript i on the у indicates that у: yı is the first у, y» is the second y, 
and so on. The complete У, notation then indicates that y is to take successively 
each of the values from the first through the nth value and these n values are 
then to be added. This complicated notation to symbolize the simple process of 
addition may seem unnecessary at the moment, but overall it is a great time 
and space saver. Let's illustrate this with an example. 


Example 3.2.1 


Suppose we have the following seven sample observations of weights (in 
pounds): 


у: = 1 pound y2= 4 pounds y3 = 6 pounds уз = 1 pound 


3.2 MEASURES OF CENTRALITY m 61 


ys 6 pounds yo = 2 pounds ут = 1 pound 


Тће sum of the seven sample observations is denoted in complete detail by: 


7: 
Тога = > y= 2 Уу = yit yet уз+ yat yst yet y7 
=1+4+6+1+6+2+1 (pounds) 
= 21 (pounds) 


For the arithmetic mean, we have 
y= Ly. 2 — 3.0 (pounds) 


A useful way of understanding the arithmetic mean is to consider the 
following weight scale with the number of observations at each weight indi- 
cated and look for a pivot point to balance the scale. If we place a fulcrum 
under the scale so that the weights are balanced, the fulcrum is placed at 3 
(pounds), the center of gravity, or in statistical terminology, the arithmetic 
mean. 


0 1 2 /3 4 5 6 7 
y (pounds) 


There is one notable peculiarity of the arithmetic mean: it is strongly 
influenced by a few odd-ball observations. Suppose that the sample of Example 
3.2.1 has one more observation added and the additional observation turns out 
to be ys=35 pounds. Then the sum of all y values is 21+35=56, and 
ӯ = (56/8) = 7.0. Thus the mean has gone from 3 to 7 pounds just because of 
one extraordinary observation. You can imagine the change in the mean per 
capita income of your town if the world's richest man should move in. 


The (arithmetic) mean has the great advantages of being easily computed 
and readily understood. You cannot use it, of course, to average hair colors or 
political party preferences. But for numerical observations, it is the average 
that gets first attention. For continuous data, it gives a definite realizable value; 
a mean weight ӯ = 6.849 pounds can exist. For discrete data, it often has to be 
understood as hypothetical: if weekly numbers of telephone calls over a year 
give y = (647/52) = 12.44, we cannot actually have forty-four one-hundredths 
of a phone call—except “оп the average." If measurements of opinion on a 
one-to-five scale are made and lead to y —3.78, we have to say that the mean 
is somewhat below scale 4. 


m SUMMARIZING DATA NUMERICALLY 


(Arithmetic) Mean Summary. The primary characteristics of the (arithme- 
tic) mean are as follows: 


1. It is used for all continuous measurements, and, with restricted meaning, 
for ordinal data. 


2. Each observation (measurement) in the sample is included in the calcula- 
tion. 


3. Extreme (very large or very small) observations have a heavy effect on its 
numerical value. 


B. Median 


Another important measure of centrality is called "the median." If we 
arrange the measurements from lowest to highest and select the “middle 
one,” we have what is known as the median. If we have an odd number of ob- 
servations, the median is exact. If we have an even number, we must average 
the “two middle" values. 

Example 3.2.2 


Using the sample of seven weights in Example 3.2.1, we must first arrange 
the weights from low to high (or vice versa). 


у=1 
ys=1 
у=1 
ув = 2 = middle value 
у2=4 
уз= 6 
уз = 6б 


The value exactly in the middle is ув = 2 (pounds). Therefore the median = 2 
(pounds). 


In actual practice we seldom need to carry along the y designations (yı, y» 
ys, etc.) in any calculation or summary. If any designation is called for, it is 
more likely to be something for indicating the place of the observation in a 
ranking by size. For that purpose, subscripts enclosed in parentheses are used. 
For instance, the data of Examples 3.2.1 and 3.2.2 are as follows: 


у:=1= уа 
уд= 1= уо) 
у=1=уб 


ув= 2 = ya = middle value 


3.2 MEASURES OF CENTRALITY m 63 


y2=4= ув) 
уз = б = yo 
уз=б= yo 


Note what happens to the median here if the outlying ys is added to the 
sample, as we discussed with reference to the mean. We would have: 


y= 1=у0 

у= 1=у0 

у= Layo) 

ув= 2= уа . 

у= 4= ‘an two middle values 
y= 6= уе 

ys= б=уо 

ys = 35 = ув) 


The median is now (2+4)/2 = 6/2 = 3.0, not a very large change from what it 
was before the extraordinary observation appeared. 


For any set of observations, the median can be obtained from the cumulative 
percentage polygon (Section 2.3) as the value on the horizontal axis deter- 
mined by the 50 percent point on the polygon. You will recall that we did 
this in Figure 2.3.1. 


Median Summary. The primary characteristics of the median are as follows: 


1. The median can be used with the ordinal type of discrete data and with 


В, 


both types of continuous data (interval and ratio scale). 


. Once the data have been ordered from either low to high or high to low, 


the median ignores all the observations except the one (or two) in the 
middle of the ordered array. 


Extreme observations have very little effect on the median. 


4. The median is the best average to use for any characteristic which by its 


nature in an entire population has many more extreme values on one side 
of the arithmetic mean than on the other. Such a distribution is said to be 
badly (or severely) skewed. Family income, per capita wealth, and annual 
wages are common examples of skewed distributions. A few extremely 
wealthy people in a community can cause the arithmetic mean income to 
be far above the typical income unless we state average in another way. 
In such a case, the median income is usually considered the appropriate 
measure of central tendency. 


64 m SUMMARIZING DATA NUMERICALLY 


C. Midrange 


The midrange is defined as the arithmetic mean of the lowest and highest 
observations in the sample: 


вага = lowest value 5 highest value (3.2.2) 


In the first example on weights, the smallest weight is 1 pound and the 
largest weight is 6 pounds. Thus 


Midrange = LM — 3.5 (pounds) 


Midrange Summary. The basic characteristics of the midrange are as fol- 
lows: 


1. The midrange is used only with interval or ratio scale data. 


2. It is an easily determined value and is a very efficient estimate when the 
sample size is small. 


3. Extreme observations have a large effect on the midrange. 


D. Mode 


The most frequently occurring value among a group of sample observations 
is called the mode. It appears as a peak in the graph of the frequency 
distribution of observations. 


In viewing distributions, it is wise to ascertain that only one peak occurs. If 
two peaks occur, any single measure of central tendency may give a misleading 
impression by implying that it describes the more common single-mode type of 
data. We call such a two-peaked distribution “bimodal.” 


Example 3.2.3 


Using the seven sample weights of Examples 3.2.1 and 3.2.2, we see that the 
l-pound weight occurs 3 times, the 6-pound weight occurs twice, and the 
2-pound and 4-pound weights occur only once. Therefore the modal value is 
one pound; thus 


mode = 1 pound 
Mode Summary. The basic characteristics of the mode are as follows: 


1. The mode can be used for nominal, ordinal, interval, and ratio types of 
data. 


3.2 MEASURES OF CENTRALITY m 65 


2. In frequency distributions like the one determined for “commuting 
distance" in the discussion on histograms, the modal value is the mid- 
point of the interval that contains the greatest number of observations. 


3. Extreme observations have little effect on the mode. 


E. Other Measures of Centrality 


There are two other special kinds of average that are sometimes discussed: 
the geometric mean and the harmonic mean. These have use in certain very 
specialized situations that arise rather rarely in ordinary practice. Since the 
remainder of our text does not utilize these measures, they are omitted from 
the book. 


A Review Illustration. Example 3.2.4. 


The following sample of n = 9 observations was taken from the set of data 
recorded in the table under Summary Exercise 3.5.1. 


yi = 2400 yo 3650 
y2= 2750 ут = 2180 
уз= 2180 ув = 2000 
у= 2320 у= 2190 
уѕ = 1930 


Using these data, calculate the following: (a) mean, (b) median, (с) mode, 
and (d) midrange. 


(a) Mean [using Formula (3.2.1)]: 


о Ey. total 
SEM Њу 
21,600 


The center of gravity (the equilibrium point) is $2400. 


(b) Median: 
First arrange the observations in order from lowest to highest. The value of 
the observation that is in the center of this ordered set of observations is the 


median: 


$1930 2320 
2000 2400 
2180 2750 
2180 3650 


2190 = median value 


66 m SUMMARIZING DATA NUMERICALLY 


(c) Mode: 

The observation which occurs most frequently is the modal value. By 
inspecting the above set of data, we find the mode = $2180. (In small samples 
of wide-ranging data, it is highly unlikely that there will be two observations 
alike. In such cases, we have to report that the sample ‘thas no mode.") 


(d) Midrange: 
The midrange as defined by Formula (3.2.2) is calculated as follows: 


а smallest observation + largest observation 
Midrange = 2 


.1930--3650 5580 
2 2 
Midrange = $2790 
Discussion of Review Illustration. Analysis of the data has resulted in the 
following: 


1. Mean = $2400 

2. Median = $2190 
3. Mode = $2180 

4. Midrange = $2790 


= 2790, 


АП these statistics are correct, but if one is determined to make inferences 
about the average income level of these counties, a choice among them needs 
to be made. 


The mean is the only statistic that uses all the data. Every data point has 
equal weight in calculating y. In general, this is the best of all the averages, but 
there should be some doubt in this case. Why? There is one very large 
income-data point, namely 3650, and this observation is making the mean 
large. 


The median is concerned only with the observation that is in the middle of 
the ordered set of observations. Thus it would make no difference how large 
the largest one was, or how small the smallest; the median would not change. 
For example, suppose that the above data were: 


3.2 MEASURES OF CENTRALITY m 67 


Original New New 
data data (1) data (2) 


$1930 51930 $ 140 


2000 2000 2000 

2180 2180 2180 

2180 2180 2180 

Median 2190 2190 2190 

2320 2320 2320 

2400 2400 2400 

2750 2750 2650 

3650 6350 2750 

Arithmetic mean 2400 2700 2090 


In all these cases the median remains unchanged, while the arithmetic mean 
changes a great deal. 


The mode is an interesting statistic but is concerned with only the most 
frequent observation. This is not a useful statistic in small samples. The concept 
of modality is important when we discuss large distributions. 


The midrange uses only the two extreme observations and is thus subject to 
extremeness in small samples, particularly here. 


Thus the choice is between the mean and the median. In income statistics the 
median is usually used since it is deemed to be more representative of the 
underlying income distribution. However, if you are truly interested in a center 
of gravity type of statistic, the mean would be used. 


68 ш SUMMARIZING DATA NUMERICALLY 


EXERCISES 
у = 


3:21 


3:22 


Twenty bottles of 14-ounce “Misty” mouthwash were taken from a grocery 
store shelf, and the amount of liquid in each bottle was measured in cubic 
centimeters (cc) and recorded as follows: 


Sample Volume (cc) Sample Volume (cc) 


1 423 11 422 
2 426 12 422 
3 421 13 426 
4 428 14 422 
5 420 15 430 
6 418 16 425 
7 423 17 426 
8 426 18 420 
9 427 19 418 
10 428 20 421 


== и 


- Determine the (arithmetic) mean volume in these 20 bottles. 

. Determine the median volume of the 20 bottles. 

- What is the modal volume of the 20 bottles? 

· Calculate the volume that represents the midrange of the 20 bottles. 

- As a result of the above calculations, what can you say about the symmetry 
of the volumes of the liquid in the 20 bottles? 


опо сте» 


A chemist was working with a new chemical reaction, and he was concerned 
about the length of time it took for the reaction to be complete. He decided to 
take 12 different samples of raw material and time the length of reaction for 
each sample. He recorded (columnwise) the following times: 


17 12 11 10 10 9 
14 11 10 10 9 9 


a. What was the average time to completion? Discuss your answer 
and justify whatever measure of central tendency you would use. 

b. If the data above were recorded in the same order that the samples were 
used, would you be concerned about the data? Why or why not? 


3.2 MEASURES OF CENTRALITY m 69 


3.2.3 The amount of total synthetic detergent (percent) in a soap product can be 
measured through a chemical analysis known as “сайопіс” SO; titrations. The 
results obtained from 24 samples of a particular detergent from a store shelf 
were as follows: 


36.8 368 363 36.1 35.0 
366 353 364 364 35.4 
36.8. 36.2 37.5. 362 35.4 
368 36.9 363 36.1 35.3 
36.1 363 363 35.6 


. Determine the arithmetic mean of the data. 
. Determine the median. 
Determine the mode. 
. If the manufacturer claimed that his product contained less than 38 percent, 
would you be concerned about his claim? 
e. If the manufacturer claimed that his product contained less than 37 percent, 
would you be concerned about his claim? 
f. Do you think it is possible for a manufacturer to guarantee that every box of 
detergent he produces has less than 37 percent TSD? (Assume, for discus- 
sion purposes only, that the product must have 35 percent TSD in order to 
be a “good” cleaner.) 


бос» 


3.2.4 Each sales district of a major sweater company reports its sales figures at the 
end of each quarter. The company has established a sales quota (target) for 
each district for each of the quarters of the year. The following data represent 
the position of each sales district relative to its quota as of July 1, 1971. 


———— == e аи етра 
District ^ Percentage of ^ District Percentage of 


Number Quota Number Quota 
Ar Атене прими ае ов SITUE TES жеш 
1 82 11 105 
2 95 12 110 
3 115 13 126 
4 104 14 110 
5 96 15 109 
6 118 16 88 
7 110 17 96 
8 98 18 102 
9 84 19 98 
10 104 20 108 


s 


a. Summarize the sales status of the company for the sales vice-president. 
b. What proportion of the sales districts had met their established quota? 
c. What proportion of the districts were doing better than 110 percent of their 


quota? 


70 m SUMMARIZING DATA NUMERICALLY 


d. If the company's sales quota for the half year was 100,000 sweaters, 
assigning 5000 to each district, did the company meet its goal? How many 
sweaters did the company sell? Suppose that each of the eight districts that 
sold 108 percent of quota or better had a quota of 2000 sweaters, while 
each of the other 12 districts had a quota of 7000. Then what was the total 
of the company's sales? What warning do these results give you about 
totaling or averaging percentages? 


3.2.5 А new liquid fabric finish is sold in a plastic bottle with a cap measured to hold 


3.2.6 


about 1.4 ounces of liquid. The instructions on the bottle state that one capful 
is needed for a normal load of wash. Suspecting that some housewives spill in 
extra product, while others do not use enough, the manufacturer obtained a 
sample of usage measurements for ten housewives. 


Ounces of Product Used 


1.1 14 
0.8 1.2 
1.2 1.6 
1.0 1.3 
0.9 1.5 


a. As a result of these 10 measurements, what conclusions would you draw 
about the usage of the product? 

b. Would you conclude that housewives do not follow directions? Why or why 
not? 

c. How many usage measurements would you feel necessary before you would 
be willing to state that housewives, on the average, use more product than 
the instructions state? 


The American Kennel Club (A.K.C.) registers over 1,000,000 pure-bred dogs 
each year. There are 116 different breeds recognized by the A.K.C. Each breed 
is ranked according to the number of dogs that are registered. In 1971 the toy 
breeds had the following ranks: 


1 41 91 

6 42 100 
8 60 109 
18 67 53 
17 84 45 
26 87 


a. What is the mean rank of the toy breeds in 1971? 

b. What is the mode? What is the median? 

с. Which statistic would you use to represent the average ranking of toy 
dogs in 1971? Why? 


3.2 MEASURES OF CENTRALITY m 71 


3.2.7 During the week ending January 30, 1971, the fifteen most active stocks on the 
New York Stock Exchange reported the following information: 


Most Active Stocks 


Company Volume Final Price Net Change 

East. Air Lines 1,014,600 208 *2i 
Texaco 987,400 34 + ё 
Ѕреггу Вапа 958,300 28$ +13 
Fed. Nat. Mtg. 886,200 61 –2 

Trans. W. Air. 836,600 18 +13 
Am. Airlines 813,500 28i +1 

Nat. Cash Reg. 751,200 393 + 
Рап Ат. 718,500 163 41 
Tex. Gulf Sul. 616,500 21 +25 
АЛ. & Т. 559,700 53; +14 
LT. & T. pf N 554,500 69 +3 
Occidental Pet. 552,500 183 +14 
Gulf Oil 529,200 29% - à 
Nwst. Airlines 518,700 278 42i 
Mad. Sq. Gar. 505,300 Ба +1 


a. Calculate the arithmetic mean and the median for volume, last price, and 
net change. 

b. Do you believe that these 15 stocks would represent the general condition 
of the market during this week? Why or why not? 

c. Obtain a copy of the Sunday New York Times and determine last week's 
most active stocks. Do the same calculations that were done in (а). 
What conclusions can you make? 


3.2.8 The New York Times Weekly combined averages for the New York Stock 
Exchange are published each Sunday in the Times. One is shown on page 73. 
a. Using the data shown on the graph, answer the following questions: 

i. What percentage of the days had a closing price closer to the low for 
the day than to the high for the day? 
ii. In the six lowest sales-volume days, does the closing price tend to be 
closer to the low or to the high for the particular day? 
iii. Using this graph, estimate the average daily sales volume. What statistic 
would you use? 
iv. What was the average daily closing price on the New York Exchange 
in the months shown? 
b. If you had all the measures of central tendency calculated, would these be 
sufficient to describe these data? Why or why not? 


72 m SUMMARIZING DATA NUMERICALLY 


3.2.9 A food company produced a new product called “Pizza Sticks," which were 
really baked shells filled with pizza. A sample of nine baked shells (empty) were 
taken from each of two different production lines and each shell was weighed. 
The weights were recorded as follows: 


Weights in Ounces 

See? mama шшш doas a мен ИН 
Line Number 1 Line Number 2 

| Прве" ^w OTRO Тошин NN 
1.14 1.22 1.27 1.22 
1.28 1.17 1.33 1.19 
1.23 1.16 1.20 1.30 
1.20 1.30 1.22 1.22 


1.22 1.25 
——————— 


a. Calculate the average weight produced on line 1 and on line 2 using the 
following statistics: (i) arithmetic mean (ii) median, and (iii) mode. 

b. Based on the results obtained in (a), can you say that the two lines are 
producing pizza stick shells with the same weight? Why or why not? 

c. What is the average weight of the pizza shells using the data from both 
lines? Do this calculation two different ways using the arithmetic mean. 


3.2.10 The annual “gross” incomes of 11 families were recorded as follows: 


19,000 10,500 
16,000 10,500 
14,500 10,500 
13,500 10,500 
13,500 8,500 
10,700 


a. Use the following statistical measures to determine an average income for 
the 11 families: (i) arithmetic mean, (ii) median, (iii) mode, and (iv) 
midrange. 

b. Discuss these four results and indicate why each might be useful to the 
reader, 


Wednesday, December 10, 1975 


Day's 


Sales 


Year to date — 


1975 


1974 


Tuesday 


Year ago 
15,680,000 16,040,000 15,700,000 4,476,074,958 3,298,680,302 


зиоциш ui sajes Арес 


High 
-«—— Closing 


<—— Low 


New York 
Stock Exchange 
composite index 


= 


| 


1975 


73 


74 ш SUMMARIZING DATA NUMERICALLY 


3.2.11 The number of live-birth certificates signed by Certified Nurse-Midwives 
(CNM) in New York City in the years 1959-1968* is shown below: 


1959 10 
1960 383 
1961 912 
1962 1610 
1963 1493 
1964 1861 
1965 1819 
1966 1852 
1967 2278 
1968 2225 


à. What is the total number of live-birth certificates signed by CNM in the 
10-year interval 1959-1968? 

- What is the average number signed per year? Of what use is this statistic? 

- What information of more practical use could be obtained from these data? 

- Tabulate the cumulative numbers of signed certificates in these ten years. 

+ Plot the data as both a frequency polygon and cumulative frequency 
polygon. 


оао с 


3.2.12 In the same article indicated in the preceding exercise, a table reporting percent 
of live-birth certificates signed by CNMs, 1959-1968, was shown as follows: 


1959 .01 
1960 .23 
1961 .54 
1962 98 
1963 89 
1964 1.12 
1965 1.15 
1966 1.21 
1967 1.56 
1968 1.57 


а. Plot these percentages on a graph. 

b. Compare this graph with the frequency polygon in the preceding exercise. 
Explain why, when the percent of live-birth certificates signed by CNMs in 
1968 increased from 1.56 to 1.57, the actual number of certificates signed 
by CNMs decreased from 2278 to 2225. 


* Summary of Vital Statistics, The City of New York, C.N. Y. Department of Health, 1959- 
1968. 


3.2 MEASURES OF CENTRALITY m 75 


3.2.13 A table in a book [Cugliani, A., and Marano, P., Heart, Cancer, Stroke, and 
Related Diseases (1968), p. 48 (excerpted here for pedagogical use)] shows the 
following: 


Deaths from Cancer, New York 


City, 1949-1967 
Total Number of 
Year Cancer Deaths 
1967 17,788 
1966 17,769 
1965 17,402 
1964 17,642 
1963 17,254 
1962 17,252 
1961 17,384 
1956-60 16,869 
1952-55 16,553 
1949-51 15,556 


Source: Vital Statistics, 1949-67, New York 
City Department of Health. 


a. What is the mean yearly total number of cancer deaths in these data? 
b. Do you think this mean is a useful statistic? Why or why not? 

c. Plot these data using a frequency polygon with the year on the x-axis. 
d. What conclusions can you draw from these data? 


3.2.14* In an article [Robertson, Robert L., Economic effects of personal health 
services: Work loss in a public school teacher population, Am. J. Public Health 
61 (1) 30-45, (1971), Table 1] are shown the mean days of work loss 
(rounded) in the study year 1966-1967 for a public-school teacher population 
by age, sex, and health plan. For our purposes the table has been modified. 


е 


Males Females 
МАЗ г CO c uc 


Blue Group Blue Group 


Age Type Practice Туре Practice 
___ 8 др LÁM 


20-24 5.35 3.33 6.07 7.13 
25-34 3.68 3.65 6.37 5.10 
35-49 4.02 4.18 5.96 5.73 
50-59 3.67 3.44 6.77 5.73 


60-64 6.94 2.29 7.76 7.88 
сс. === 


*Copyright 9 1971 by the American Public Health Association, Inc. Reprinted with permission 
from the author and publisher. 


76 m SUMMARIZING DATA NUMERICALLY 


3:12:15 


cm 


- Using a histogram, plot the data for males with a "blue"-type health plan. 
- Using a histogram, plot the data for males with a group-practice health plan, 
. Using frequency polygons, plot on the same graph the data for females with 


a blue-type health plan and females with a group-practice health plan. 


. If you wish to compare two health plans, which graphical procedure would 


you use? 


- In order to determine the average days of work loss in this study population 


of teachers, would you add up all the data in the table and divide by 207 
Why or why not? 


In 1969, 18-year-old males were subject to the draft (service in the armed 
forces). The order in which they were called up was determined by a "lottery," 
supposedly completely at random. January dates were drawn out in the 
following sequence. 


mI UT Sec 


Lottery Lottery 
Number Date Number Date 
17 Jan. 15 101 Jan. 5 
52 Jan. 25 118 Jan. 23 
58 Jan. 19 121 Jan. 16 
59 Jan. 24 140 Jan. 18 
7 Jan. 28 159 Jan. 2 
92 Jan. 26 164 Jan. 30 
186 Jan. 21 280 Jan. 20 
194 Jan. 9 305 Jan. 1 
199 Jan. 8 306 Jan. 7 
211 Jan. 31 318 Jan. 13 
215 Jan. 4 325 Jan. 10 
221 Jan. 12 329 Jan. 11 
224 Jan. 6 337 Jan. 22 
235 Jan. 17 349 Jan. 29 
238 Jan. 14 355 Jan. 27 


a. Calculate the average lottery number for males born in the month of 


January. 


3.2 MEASURES OF CENTRALITY m 77 


3.2.16 The average lottery numbers for the 12 months in the 1969 draft lottery were 


~ 


as follows: 
January 201 July 180 
February 203 August 173 
March 226 September 157 
April 204 October 182 
May 208 November 149 
June 196 December 122 


= 


. Using a bar chart, plot these figures. 

b. What conclusions would you draw by inspecting these data? 

c. Based on the above monthly averages, what is the grand average of all of 
the lottery numbers? 

d. What is the true grand average of the lottery numbers? Explain the 

discrepancy between this result and the one you obtained in (c). 


A manufacturing concern introduced а new product on the market and retail 
sales began on June 1, 1970. During the next 20 months the shipments made to 
all outlets were watched closely for indications of sales problems. The data 
obtained were as follows (figures are sales in thousands of cases): 


1970 June 73 April 51 

July 82 May 53 

August 97 June 68 

September 101 July 81 

October 43 August 76 

November 59 September 73 

December 43 October 50 

November 53 

1971 January 52 December 55 
February 44 

March 44 1972 January 53 


. What is the mean monthly sales figure during this time period? 

. What is the median monthly sales figure? 

. Plot the data using time on the horizontal axis. 

. What statements would you be willing to make about this new product? 

. What do you expect to be sold in the month of February 1972? State your 
reasons. 


оао сьв 


78 m SUMMARIZING DATA NUMERICALLY 


3.2.18 Small transistorized radios were produced in a certain factory. This factory was 
a three-shift operation. Each shift a sample of 90 radios was checked and 
graded on a quality rating a, b, c, d, e, or f, with “а” being first quality and “e” 
and “f” being off-quality or rejectable. A certain day gave data as follows: 


—————————— . 


Shifts Quality Rating 
a b с а е f 
See 
Day (8-4) 15 25 8 6 18 18 
Evening (4-12) 1 29 8 15 7 14 
Graveyard (12-8) 8 21 10 27 15 9 


à. What statistic or statistics would you use to characterize the shifts? 
b. What conclusions would you be willing to make about the relationship 
between quality and shift? Can you be certain about these conclusions? 


3.2.19 Ina community in North Carolina, the occurrence of hepatitis cases for 5 years 
was reported as follows: 


еее ———=——__—_—_—_____ 


Үеаг 1961 1962 1963 1964 1965 


Number of 
cases 7 10 8 44 11 


Comment on each of the multiple-choice completions in the following state- 
ment. The average number of cases as determined by the median of the 5 
years' experience is: (a) 16, (b) 8, (c) 44, (d) 10, and (e) 21. 


3.2.20 Calculate: (a) arithmetic mean, (b) mode, and (c) median for the following data: 


= == ез8 
Diastolic Blood 
Pressure (mm Hg) 
ee „АА 


3.2 MEASURES OF СЕМТВАШТУ m 79 


3.2.21 Calculate: (a) arithmetic mean, (b) mode, and (c) median for the following data 
on height, reported here to the nearest inch. 


Height (in.) Frequency 


57 2 
58 4 
59 14 
60 41 
61 83 
62 169 
63 394 
64 669 
65 990 
66 1223 
67 1329 
68 1230 
69 1063 
70 646 
71 392 
72 202 
73 79 
74 32 
75 16 
76 5 
77 2 


3.2.22 Isoniazid given orally for 20 weeks after the beginning of infection will prolong 
the life of rats suffering from leprosy. The following data show the survival time 
for a group of 10 rats after such treatment. 


Survival Time (weeks) 


51 


80 m SUMMARIZING DATA NUMERICALLY 


a. Calculate the arithmetic mean, median, and mode of these data. 
b. On the basis of these data, can you say that the use of isoniazid prolongs the 
life of rats suffering from leprosy? Why or why not? 


3.2.23 In a large supermarket, the following sales of ultralarge packages of detergent 
were recorded: 


———————— — 
Week Sales Week Sales 


1 22 13 23 
2 18 14 16 
3 18 15 24 
4 22 16 19 
5 24 17 18 
6 25 18 21 
7 28 19 22 
8 28 20 17 
9 20 21 22 
10 19 22 16 
11 23 23 17 
12 21 24 20 
—— M И 


а. Plot the weekly sales figures against week number using a frequency polygon. 

b. Calculate the arithmetic mean of each set of 4 weeks' sales (i.e., weeks 1-4, 
5-8,...,21-24). 

с. Plot the means of part b as a frequency polygon. At what values of the 
horizontal scale are these means plotted? What assumption is made when 
you plot these means? 

d. Discuss the plotting of weekly sales figures against 4-week average sales 
figures. When would you use both plots? 


3.3 MEASURES OF VARIABILITY m 81 


3.3 MEASURES OF VARIABILITY 


In the previous section we were concerned with the various methods of 
characterizing the centrality of a set of data. However, many different sets of 
data could have the same measurement of centrality and still be very different. 
For example, while the following four distributions all have the same arithmetic 
mean (i.e., the same balance point), they are very different from one another. 
We say that the dispersion, variability, or variation of the data is different in the 
various sets. Thus if we are to make good predictions or estimates from 
observed data, we need to include in our analysis some measure of the 
dispersion of the data. We shall consider the four main measures of variation in 
current use: (a) range, (b) variance, (c) standard deviation, and (d) coefficient of 
variation. 


| | 
= А ЫТ? 


[h Hl. нь db 


A. Range 


The range is defined as the difference between the largest and the smallest 
Observation. 


Range = largest y — smallest y (3.3.1) 


82 m SUMMARIZING DATA NUMERICALLY 


Example 3.3.1 


Using the seven weights used previously, where the largest weight was 6 
pounds and the smallest weight was 1 pound, we calculate: 


Range = 6— 1 = 5 pounds 


The range is very easy to calculate, and it gives us some idea about the 
variability of the data. However, the range is at best a crude measure of 
variation, since it uses only two sample values. It is most useful with small 
samples, of size 10 or less. 


B. Variance (s?) 


A better way to measure the variability of a set of data is to measure how 
each observation in the data set differs from the arithmetic mean y and then to 
obtain a statistic using these differences so as to reflect an "average" deviation 
from y. Let us begin by determining the deviations of each weight from the 
mean weight in the sample of weights we have been using. 


Observation Weight Deviation 
Number yi y-y 


пи ни wow ow 
| 

юз озю о ~ № 
< 
Џ 
Ы 
' 
о 


NOORWN ә 
NOOA 


Total 


| 


Notice that the total of the column of deviations is zero. This is no mere 
coincidence. It will always be true for the sum of such deviations. After all, the 
mean y is the center of gravity of the set of data, and so the “overs” and 
"unders" have to balance out around that center. 


Since the sum of the deviations from y is always zero, dividing this sum by п 
to get an average deviation would always yield zero also. Hence that would be 
of no use as a measure of the variability in the data. One mathematical device 
that makes sense to get around the difficulty is to square each deviation, add up 


3.3 MEASURES OF VARIABILITY m 83 


the results, take some kind of average, and extract the square root to get back 
to the original units of measurement. 


There is one special wrinkle that most modern statisticians use here: the 
"average" of the squared deviations is found by dividing the total of those 
squared deviations by (n — 1) instead of n. We will say a few words about this in 
a moment or two. 


The average measure of squared deviation calculated in this way is called the 
sample variance, and is indicated by 57. While n is the sample size, the number 
п— 1 is called the number of degrees of freedom. 


total sum of the squares of deviations of 
observations from the mean (3.3.2) 


Variance = number of degrees of freedom 


Example 3.3.2 
Our sample data on weights now give their sample variance as follows: 


y: “yoy (у= 9)? 
1 -2 4 
4 1 1 
6 3 9 
1 -2 4 
6 3 9 
2 = 1 
1 -2 4 
21 0 (check) 32 
ПОЈ па ПА ТЕ 22 m mA e wr у, 
012124 1.932. 
у= VIA 3.0; 5 6 5.33 


84 m SUMMARIZING РАТА NUMERICALLY 


Intuitive Explanation of Degrees of Freedom (here, n — 1). Throughout 
this book we have considered our observations as a sample (subset) from a 
large population. For example, the 180 upper-class students of Mecca Com- 
munity College are a sample of the total upper-class student body. After the 
sample has been taken, our purpose is to use the sample information we obtain 
to infer something about the larger population. Our thinking goes like this: 


l. We have available a very large population from which to draw our 
sample. 


2. We decide to take a sample of size n. (The manner in which the sample is 
taken is important and will be discussed later in the book.) 


3. We say, then, that we have n "degrees of freedom" in the sample. Each 
of the n observations represents one "degree of freedom." 


4. Once we have taken the sample and obtained the Observations, we decide 
to characterize the sample by calculating some statistics. Our idea is to 
use the sample statistics to infer about the larger population. The first 
sample statistic we calculate is the arithmetic mean (Y), which represents 
one degree of freedom (1 d.f.) and now we have y and, in essence, only 
n —1 d.f. (pieces of sample information) left over to be used to obtain 
additional independent statistics. 


5. Now, if we use y in calculating a new statistic, any averaging we do ought 
to be done using (n — 1) d.f. Therefore when we calculate the variance s, 
using У (y— y) in the numerator of the formula, any averaging we do 
should use n — 1 as the divisor. Thus we define the sample variance as in 


(3.3.2): 
52 = F (у = ï)? 
n-1 


The reader is entitled to ask “If the above is an intuitive explanation of 
degrees of freedom, what would a scientific explanation be?" That question 
should be ducked by author and reader alike in the beginnings of statistical 
learning. The answer is involved in mathematical theory and usage connected 
with random behavior of statistics like s? across the universe of all possible 
different samples. We shall say a few words about this in a later chapter, but 
any reasonably clear as well as precise explanation does indeed require a level 
of study that logically comes after this book. Try to trust us that the long-run 
behavior of 5" is generally considered better than it would be with n in the 
denominator, and that your discomfort here will be rewarded by certain 
comfortable simplifications in procedures later on. 


3.3 MEASURES OF VARIABILITY m 85 


C. Standard Deviation (s) 


The variance s? is in squared units. In our weight example, that means s? is 
in units of (pounds)^—squared pounds! To convert our measure of variability 
back into the original units of measurement, we take the square root of 57. This 
gives us s, which we call the sample standard deviation, where standard 
deviation equals the square root of the variance. 


(3.3.3) 


Example 3.3.3 


Our data on weights gave us (Example 3.3.2) the variance 52= 5.33. So now 
we have the sample standard deviation: 


s = Vs! = 5.33 = 2.31 (pounds) 


The standard deviation is one of the most important statistics used in 
practice. We shall be using it a lot throughout this text. 


D. Coefficient of Variation (C.V.) 


The coefficient of variation expresses the standard deviation as a percentage 
of the arithmetic mean. 


Coefficient of _ standard deviation ,. 100 


variation (percent) arithmetic mean 
(3.3.4) 


C.V. (percent) zi 100 (percent) 


86 m SUMMARIZING РАТА NUMERICALLY 


Example 3.3.4 


For our sample of weight data (Examples 3.3.1–3.3.3) we collect results to 

obtain: 
aM 152131 776 
CY. есета 100 = 3750 * 100 =77% 

The coefficient of variation is useful in comparing the relative variability of 
different kinds of characteristics. For example, it can be used to compare the 
variability of county income level in a state with the variability of county 
population size. Here one is comparing two different classes of measurement, 
one in dollars and the other in numbers of people. The coefficient of variation 
puts both of these on the basis of variability as a percentage of the mean, thus 
getting an index that is free of the unit of measurement. To avoid ambiguity in 
meaning, the coefficient of variation is best limited to use for data that are 
always positive and measured on a ratio scale. 


Review Illustration. Example 3.3.5. 
Consider the data on income from the review illustration in Example 3.2.4: 


y 


2400 
2750 
2180 
2320 
1930 
3650 
2180 
2000 
2190 


Calculate: (a) range, (b) variance, (c) standard deviation, and (d) coefficient of 
variation. 
a. Range: 
Range = largest — smallest y [Formula (3.3.1)] 
= 3650 – 1930 


Range = $1720 


3.3 MEASURES OF VARIABILITY m 87 


b. Variance: 
EC 
= хо) [Formula (3.3.2)] 
y УУ wr 

2400 0 0 
2750 + 350 122,500 
2180 — 220 48,400 
2320 - 80 6,400 
1930 — 470 220,900 2 _ 2,213,200 _ 2,213,200 
3650 +1250 1,562,500 ӨЛ е = ak тач 
2180 — 220 48,400 
2000 — 400 160,000 52= 276,650 
2190 — 210 44,100 

У у=21,600 0 2,213,200 
у= 2,400 


c. Standard deviation: 


5 = \/ Zo- [Formula (3.3.3)] 


Using the result from (b), we have 


s =V276,650 = 525.9753 
s = $525.98 


d. Coefficient of variation: 
Coefficient of variation (percent) -3* 100 [Formula (3.3.4)] 


_ 525.98 
= 2400 


С.У. (96) =21.92% 


х 100 


88 m SUMMARIZING DATA NUMERICALLY 
3.4 SOME COMMENTS ON TERMINOLOGY AND COMPUTATION 


Students often complain that mathematics is a tyranny of exotic words and 
mysterious calculations. The same can be said for statistics. But then the same 
can be said for any subject matter dealing with ideas needing very precise 
definition and capable of quantitative measurement. Exotic words can be very 
helpful when their meaning is clear, and even mysterious calculations are 
welcome if they cut down labor. 


We have introduced a variety of specialized words: mean, median, mode, 
range, variance, standard deviation, and coefficient of variation. Each has a 
precisely specified definition, giving a clearly stated characteristic of a set of 
data. You will find these technical terms useful in summarizing data and in 
communicating results to other investigators who will be using the same 
language. 


In this connection, we should like to introduce you to a very special term 
that gets a great deal of use in a wide variety of ways in statistical analysis. It is 
sum of squares. We say it is a special term because it really does not mean 
strictly what it says. It means sum of squared deviations, the deviations being 
measured between the sample observations and their arithmetic mean. Thus 
for observations yı, y», уз,..., у we define: 


Sum of squares for y= У (y, — ў)? (3.4.1) 
1 


In terms of (3.4.1), the sample variance s? can be defined as: 


М ЗЕ Sum of squares for у 
ncs 

Calculations of various sums of Squares play a large part in statistical 
analysis. While an analyst can always hope to have available a machine which 
computes sums of squares automatically, the hope is often not realized. Then 
the labor of calculating Y (y — y)? becomes important. You may have already 
foreseen from the examples on weights and incomes that computing a sum of 
squares can be a big chore. In our examples, ӯ = 3 exactly, or y = 2400 exactly, 
so that figuring the differences y;— y, squaring them, and adding are no big 
deals. But you can be sure that in general ӯ will come out with decimal places 
more numerous than in the observations themselves. And then the deviation- 
square task gets tedious. For that reason, statisticians have worked on the 
formula with a bit of algebra to come up with some alternative expressions that 
avoid taking the individual differences: 


34 SOME COMMENTS ON TERMINOLOGY AND COMPUTATION m 89 


2 
Sum of squares for y => (у—ў) =>, y-C».y y-ny (3.4.2) 


In Example 3.3.2 we worked out У (y — y) directly, obtaining 32. By (3.4.2) 


we have: 


j= ==3.0, 
yo ys аласт 
a _ gyn 95 — QD – 95 441 
4 te У (у- у = 95-12 -=95—-7 
s =95—63 =32 
6 36 or 
2 4 
101 Y (-5y-95-7(3y = 95-79) 
21 95 =95—63=32 

$232. 5,33 


In the review illustration of Example 3.3.5, we have У, (у= 9)? = 2,213,200. 
Use of (3.4.2) gives: 


21,600 


2 


d 21,600 
у =“ = 2400 

5,760,000 y 9 
7,562,500 
4,752,400 " 
5,382,400 У (y - 3 = 54,053,200— (21,600) 
3,724,900 

py" 466,560,000 
4,752,400 a _ 466,560,000 
4,000,000 54,053,200 5 
4,796,100 

54,053,200 = 54,053,200 — 51,840,000 = 2,213,200 


or 
У, (у–ђ) = 54,053,200 — 9(2400)* 
= 54,053,200 —9(5,760,000) 
— 54,053,200 — 51,840,000 — 2,213,200 


5-221200. 276,650 


90 ш SUMMARIZING DATA NUMERICALLY 


An additional comment on computing is required for the case where the data 
are available to you only in the form of a frequency table. As an example, 
suppose that the following table is all the information you have about the 
Observations in a sample of size 209. 


Cholesterol Level 
(mg/100 ml) Frequency 


125-149 4 
150-174 13 
175-199 30 
200-224 42 
225-249 51 
250-274 34 
275-299 23 
300-324 8 
325-349 3 
350-374 1 

209 


As we pointed out when we discussed the compilation of such tables in 
Chapter 2, the individual identities of the observations have been lost. АП we 
know is the number of observations in each of the indicated intervals, and the 
best we can do is to approximate the real distribution by assuming the 
observations in any interval to be evenly distributed across that interval. So far 
as taking the sum of the observations is concerned, this assumption is equiva- 
lent to having all the observations in an interval measured as if at the midpoint 
of the interval: evenly distributed values would balance out “overs” and 
“unders” around that midpoint. Calculations of mean and standard deviation 
then proceed by adding in batches, as shown below. 


The interval midpoints are found by carefully setting up the exact numerical 
boundaries on the scale of intervals and then taking the mean of each interval’s 
beginning and ending points. 


34 SOME COMMENTS ОМ TERMINOLOGY AND COMPUTATION m 91 


Contribution to 


Interval 

Interval Cholesterol Frequency midpoint Xy Ly 

limits level (f) (y) (fy) (fy?) 
124.5-149.5 125-149 .4 137 548 75,076 
149.5-174.5 150-174 13 162 2,106 341,172 
174.5-199.5 175-199 30 187 5,610 1,049,070 
199.5-224.5 200-224 42 212 8,904 1,887,648 
224.5-249.5 225-249 51 237 12,087 2,864,619 
249.5-274.5 250-274 34 262 8,908 2,333,896 
274.5-299.5 275-299 23 287 6,601 1,894,487 
299.5-324.5 300-324 8 312 2,496 778,752 
324.5-349.5 325-349 3 337 1,011 340,707 
349.5-374.5 350-374 1 362 362 131,044 
209 48,633 11,696,471 


__Б у _48,633 _ 
j о 723269, 


У (у–уг= У у:—пуг = 11,696,471— 209(232.69)* 
= 11,696,471 –209(54,144.6361) 
= 11,696,471 11,316,229 = 380,242, 


2_ 380,242 _ 
SUA = 1828.1, 


s =V1828.1=42.8 


To find the median in such a table, we look for a point on the y scale at 
which 50 percent of the observations have accumulated. The procedure is seen 
best in graphical form. For n = 209, the median is at a point where we have the 
hypothetical accumulation of .50 х 209 = 104.5 observations: 


(Frequency) — (4) (13) (30) (42) (51) 


124.5 149.5 174.5 199.5 224.5 249.5 


92 m SUMMARIZING DATA NUMERICALLY 


We have 89 observations up to 224.5 and so need 104.5— 89 = 15.5 Observa- 
tions out of the next interval. To get that many out of the 51 observations in 
the interval, we argue that the even distribution assumption tells us to go 
(15.5/51) of the way through the interval, and in distance that is (15.5/51) of 
the width of the interval. Thus 


Median = 224.5 +S (25) 


387.5 
51 


=224.5+7.6 
=232.1 


=224.5+ 


Find the range, the variance, the standard deviation, and the coefficient of 
variation in each of the following cases on which you worked in the preceding 
set of exercises: 


Exercise 3.4.1 “Misty” mouthwash data of Exercise 3.2.1. 
Exercise 3.4.2 Chemical reaction times of Exercise 3.2.2. 
Exercise 3.4.3 Detergent percentages in a soap product (Exercise 3.2.3). 


Exercise 3.4.4 Liquid fabric finish used in a normal load of wash (Exercise 
3:25). 


Exercise 3.4.5 Pizza shell weights in two different production lines (Exer- 
cise 3.2.9). 


Exercise 3.4.6 Monthly sales figures for a new product (Exercise 3.2.17). 
Exercise 3.4.7 Diastolic blood pressure measurements (Exercise 3.2.20). 
Exercise 3.4.8 Grouped data on height given in Exercise 3.2.21. 


Exercise 3.4.9 Survival data on experimental rats reported in Exercise 
3:2:22' 


35 SOME COMMENTS ON LOOKING SUMMARY STATISTICS IN THE EYE m 93 


35 SOME COMMENTS ON LOOKING SUMMARY 
STATISTICS IN THE EYE 


While all the measures of centrality and variability that we have discussed 
are useful indicators of general tendency and the amount of variation around it, 
the user must be very careful in applying such measures to real data. DON'T 
FORGET TO LOOK AT THE RAW DATA. This may seem a simple 
requirement to state, but, unfortunately, there are many cases where it is not 
possible. For example, when the data set is very large (say n > 100), looking at 
the raw data tables is not very helpful. Here, however, utilizing some of the 
tabular and graphical techniques already discussed is very helpful and should 
always be done. In other cases, only secondary data sources are available, such 
as grouped frequency tables published in journals. The electronic computer is 
also a source of some irritation on this matter; in many computer programs the 
original data never see the light of day! The printout shows only an analysis of 
the data. 


Insist on seeing the actual data whenever you can. In the first place, the 
number of faulty data points discovered in many listings is usually quite 
surprising to the beginning data analyst. Error in recording a measurement, 
error in making the measurement, omission or duplication of a measurement, 
all of these can occur. They can sometimes be spotted by studying the raw data, 
sometimes not. Their existence is a constant hazard since all the statistics in the 
world will not correct these kinds of mistake. 


So far as summary statistics are concerned, we have seen that each statistic 
that can be used has its own peculiar applicability. The only way we can check 
up on this is to see the data and try to reach a conclusion concerning the 
validity of using the statistic. 


To further illustrate these points, let's consider the following real data on a 
botulism (a special kind of food poisoning) epidemic in La Plata, Argentina, in 
June 1957. The time from eating the tainted food until the onset of symptoms 
is called “the incubation period." The incubation periods of the first 21 cases 
are shown in the following table. (For the sake of simplicity, three numbers 
have been modified by one hour so that the arithmetic is easy; this does not 
change the conclusions.) 


94 m SUMMARIZING DATA NUMERICALLY 


{TOER nd 
Incubation Periods in Botulism Victims 


(hours) 

a Se vo RE 
1. 14 8. 48 15. 36 
2:19 9. 45 16. 43 
3. 20 10. 48 17. 29 
4. 20 11: 21 18. 19 
5. 32 12. 78 19. 43 
6. 34 13. 17 20. 85 
7. 36 14. 20 21. 91 


These data were collected to get some idea about the characteristics of the 
incubation period of the disease in this particular outbreak. The following 
points should be determined: 

(a) average incubation period for botulism in this epidemic and (b) degree of 
variability of incubation period. 

The immediate response would be to calculate the arithmetic mean to 
determine point (a) and to calculate the standard deviation for point (b). In this 
case 


y = 38 hours 
s = 22.4 hours (coefficient of variation = 58.9 26) 


Verify these calculations. 


However, this is an inadequate answer. Notice that the standard deviation is 
almost 60 percent as large as the mean. Whenever the sample standard 
deviation is this large, one should examine the data carefully. Quite often one 
finds a few very large or very small observations. These should be investigated 
to make sure that they are valid and belong. Table 3.5.1 shows all of the 
individual contributions to the standard deviation. 


3.5 SOME COMMENTS ON LOOKING SUMMARY STATISTICS IN THE EYE m 95 


TABLE 3.5.1 Standard Deviation Composition 


Case Number y y-y (у= Ӯ)? 

1 14 –24 576 

2 19 -19 361 

3 20 —18 324 

4 20 -18 324 

5 32 -6 36 

6 34 -4 16 

7 36 =2 4 

8 48 +10 100 

9 45 +7 49 

10 48 +10 100 

11 21 -17 289 

12 78 +40 1,600 

13 17 -21 441 

14 20 -18 324 

15 36 _2 4 

16 43 + Б 25 

17 29 = 9 81 

18 19 —19 361 

19 43 t5 25 

20 85 +47 2,209 

21 91 +53 2,809 

Totals 798 0 (check) 10,058 

798 10,058 

ren a s= —9 = У502.9 = 22.4 


[Median = yas = Ув = 341 


Notice that the incubation periods of three of the cases (12, 20, 21; namely 
78, 85, and 91 hours) account for 


1600 2209 + 2809 e о 
— ена x100% = 65.8% 


of the total sum of squares used in calculating the standard deviation. These 
three extremely long incubation periods are the major contributors to the large 
standard deviation. Furthermore since they are all on one side of the arithmetic 
mean, we can also say the sample distribution is skewed (to the right, since the 


96 ш SUMMARIZING DATA NUMERICALLY 


TABLE 3.5.2 Distribution of 
Incubation Periods 


Class Interval Frequency 


10-24 
25-39 
40-54 
55-69 
70-84 
85-99 


| ита 


Total 21 


distribution has a stretched-out right side). By plotting the data, using a few 
intervals, say six, we can see this in Table 3.5.2. 


Frequency 


As a result of the histogram plot, we might go further and conjecture 
the shape of the incubation period distribution for a theoretical botulism 
epidemic to look like the smooth curve put through the data. While we cannot 
prove this, at least it is the possible foundation for a hypothesis about the shape 
of the incubation period for a typical botulism epidemic. 


Thus by carefully examining the individual data and the elements used in 
calculating s, and then using plotting of grouped data, we have arrived at a 


reasonable hypothesis or conjecture for the distribution of the incubation 
period of botulism. 


35 SOME COMMENTS ON LOOKING SUMMARY STATISTICS IN THE EYE m 97 


Another guideline useful for detecting the unusual behavior of some of the 
observations is to determine by how many sample standard-deviation units 
each observation deviates from the sample mean. For example, the botulism 
data show the following (Table 3.5.3): 


TABLE 3.5.3 Standardized Deviations of Incuba- 


tion Periods 
Standardized 
Incubation Period Deviation deviation 
(Hours) from y y-y 
y yy XM 
14 —24 —1.07 
19 –19 – 85 
20 –18 — 80 
20 -18 — 80 
32 -6 = 27 
34 -4 = ;18 
36 = 2 = 09 
48 +10 + 45 
45 +7 + 31 
48 +10 + .45 
21 =17 = 76 
78 +40 +1.78 
17 -21 = .94 
20 —18 — .80 
36 = 2 — .09 
43 +5 7:22 
29 -9 — 40 
19 -19 — 85 
43 +5 + .22 
85 +47 +2.10 
91 +53 +2.36 


As you can see, 17-of the 21 observations are within one standard deviation 
of the sample mean. Furthermore, three of the four that deviate by more than 
one standard deviation are on the positive side. This, plus the fact that 13 of 
the 21 are small negative deviations, leads to the conjecture that the data are 
Skewed to the right; that is, they have an asymmetrical sample distribution with 
right-hand stretchout. 


98 m SUMMARIZING DATA NUMERICALLY 


SUMMARY EXERCISES 


3.5.1 The following table gives data on per capita income in 134 counties reported in a 
study of food programs in social service. 


Per Capita Income ($) in 134 Counties Having 
Food Distribution or Food-Stamp Programs 


2400 1820 3280 2550 1770 2850 
2210 5400 1870 2660 1800 1200 
2150 2175 2300 1710 1600 2620 
2750 2780 1680 2320 2310 2000 
1975 1800 1835 700 3450 1760 
2650 3300 1400 478 2210 1695 
3570 1625 530 1830 2460 2350 
2020 1910 1720 3150 1830 1625 


2710 2200 2010 1840 810 1900 
2180 1800 1920 1815 2510 520 
1970 1750 1675 1250 1930 2610 
3850 2700 800 1950 1550 2480 


1730 2220 1675 1650 400 2330 
1760 2540 1695 2100 1810 1575 

515 1770 2760 1710 750 1860 
3350 500 3050 3200 3250 3900 
1850 2870 600 1750 2470 1990 


а. Describe the information contained in these data by constructing a fre- 
quency table (take intervals 250-749.99, 750-1249.99, 1250-1749.99, etc., 
and round the midpoints to 500, 1000, 1500, etc.). Using this table, draw a 
frequency histogram and a frequency polygon. 


b. Calculate the following statistics: (i) range, (i) mean, (iii) median, (iv) 
standard deviation, and (v) coefficient of variation. 


3.5 SOME COMMENTS ON LOOKING SUMMARY STATISTICS IN THE EYE m 99 


3.5.2 To get some practice in dealing with data that relate to a variety of characteris- 
tics in a single study, let us consider again the survey of upper-class students in ' 
Mecca Community College (Table 1.2.1). To keep down computational labor, 
and to see how different samples behave, separate the 180 observations into four 
samples: 


Sample Number Student Numbers 


1-45 
46-90 
91-135 
136-180 


PWN 


Each of the four samples has size n = 45. We shall assume that the method of 
putting together the survey of 180 students allows us to consider each of the 
samples of size 45 to be a valid random sample for representing the upper-class 
students. 


By instructor’s edict, town-hall meeting of the class, drawing numbers out of a 
hat, or any other nonviolent procedure, distribute the four samples around and 
around, one to each student. Let each student then work on his or her sample, 
using the data to give answers to the following questions. Extra credit is 
available for coming up with additional features that can be wrung out of the 
data. 


a. What percentage of the sample is: (i) male and (ii) female? 


b. What is the “average” commuting distance of the students in the sample? 
(Look at mean and median, distinguish between them.) What is the standard 
deviation of the commuting distance? The coefficient of variation? What 
does the distribution of commuting distance look like? Is there a difference 
in commuting distance between that for females and that for males? 


c. What can we say about political party preference among the students in the 
sample? Does it differ with sex? Does it change depending on how far one 


commutes? 


d. How do these students vote on the “legalizing of marijuana" question? 
What percentage of the students agree to legalization? How does this 
percentage differ: (i) between the two sexes and (ii) among the four 
categories of political party preference? 

e. What does the sample distribution of the GPA's look like? What are its 


mean and standard deviation? How does the G.P.A. differ between the 
sexes, if at all? Is there a relation between G.P.A. and commuting distance? 


f. If you were to use the answers to the above questions on the sample of 45 to 
infer conclusions about the sample of 180, on which of your answers would 


you feel you couldn’t be far wrong? 


41 DESCRIPTION AND 
INFERENCE IN STATISTICS 


Up to now we have been consid- 
ering orderly procedures for de- 
scribing and summarizing a collec- 
tion of observed numerical data. In 
most cases of practical interest, 
such a collection of data is only a 
small fraction of all possible obser- 
vations that could in principle be 
made. It is a sample from a 
universe, or population, of out- 
comes in the particular process 
under study. 

The data on community-college 
upper-class students in Chapter 1 
came from one sample of upper- 
class students in one specific year 
in one specific college system. A 
different sample would yield some- 
what different data for the upper- 
class population of that system in 
that year. A different year would 
give a different set of data from all 
possible sets of data over time in 
the specific system. A different sys- 
tem would yield a different collec- 
tion of data on the population of 
community-college upper-class stu- 
dents in a wider region. 

The data in Exercise 2.3.3 for 
the source of funds in the XYZ 
Health Department in 1960 and 
1965 are specific for that health 
department in the stated years. 
Different health departments 
and/or different years would give 
different data on funding sources in 
health departments. 

The age distribution reported on 
page 44 for patients treated for 
carcinoma of the cervix in the Bar- 
nard Free Skin and Cancer Hospi- 
tal, St. Louis, during the period 


100 


Statistics and 
Chance 


4.1 DESCRIPTION AND INFERENCE IN STATISTICS m 101 


400 SAID THERE WAS А 
BILLION-TO-ONE CHANCE 
THAT WE MIGHT WIN.. 


PEANUTS 


BOY, YOU JUST CANT 


BUT uE DIDN'T ! 
BELIEVE ANYONE ANY MORE! 


WE LOST! 


1906-1926 is precise for that specific hospital during that specific 
span of years. Observation of ages of such patients will give different data for 
other possible hospitals and other observable years. 


While the descriptive statistics that we have been studying are extremely 
valuable for assessing what we have observed, it is customary for the data to be 
given the additional obligation of telling us something dependable about the 
universe from which our observations were drawn as a sample. What can we 
say about upper-class students at Mecca Community College in the next 5 
years? What can we say about upper-class students in American community 
colleges generally? What can be said about Health Department funding across 
the state of North Carolina, now and in 1980 and 1985? To what extent do the 
Barnard Hospital data represent the age distribution of all Americans now 
having carcinoma of the cervix? 


In the majority of cases, the purpose of collecting data in the first place is to 
form the basis of making a generalization to a much larger universe—the total 
Population in a country, the underlying biological process, the ongoing produc- 
tion line, or the actual natural law, from which the observations are taken as a 
sample. This is the field of statistical inference; from a particular sample we 


102 m STATISTICS AND CHANCE 


want to be able to infer one or another characteristic of the universe that 
produced the sample. The report in Exercise 2.3.2 concerning the employment 
of persons 14 years of age and over as of March 1968 came from such an 
inference. The Bureau of the Census did not count and describe every 
employed person in the United States in March 1968; it used the data of a 
sample scientifically chosen by the Bureau of Labor Statistics and generalized 
in a dependable manner to the entire country. 


The logic that we use in statistical inference is inductive logic, arguing from 
particular cases to the general law. Such inductive argument has two main 
aspects: (a) estimation of critical characteristics of the universe and (b) test ofa 
hypothesis concerning the universe. The latter aspect is a fundamental feature 
of the time-honored "scientific method": form a hypothesis about a state of 
nature, make observations on that state, сог.‚раге observations with hypothesis, 
and accept or reject the hypothesis according as the observations are or are not 
consistent with the hypothesis. 


Estimation is also an essential feature of scientific investigation, for it is on 
the basis of observations that the scientist arrives at values of critical physical 
constants whenever they can not be found by purely theoretical logic. Until 
more advanced mathematical methods were applied, the value of т was ап 
estimate, for example, 3 in the Bible, (16/9) in ancient Egyptian writings. 
While the so-called absolute constants, like 7, can eventually be determined 
mathematically to within any desired degree of accuracy, there remains the vast 
variety of processes in nature wherein the constants of importance can be 
approached only through observation—the growth rate of a bacterium species, 
the constants defining the accident rate on a particular superhighway, the 
P of recoveries from a given disease after use of a given drug, and so 
on and on. 


The essential features of a situation that requires statistical inference are: (a) 
variability in the population—the circumstance that not all elements of the 
universe exhibit the same value for the characteristic under study and (b) 
sampling—the circumstance that our observations constitute only a fraction of 
all possible observations in the universe. 


Variability in the population causes it to exhibit what is called random 
behavior. Different members of the population have different values of the 
characteristic under study, and we cannot know ahead of time the exact value 
that will be shown by a member chosen "at random" from the population. 


When we have chosen *at random" a number of members of the population, 
thus forming a sample, and have observed their values, we have seen the 
outcome of one experiment on the population's random behavior. We know 
that this cannot give us the precise truth about the entire population. What we 
should reasonably want, therefore, is an inductive procedure that will enable us 
to have a definable level of confidence that the truth is within specifiable 


4.2 DEFINITION OF PROBABILITY m 103 


practical limits, or that a decision we make about a hypothesis will be correct. 


Terms like “random,” “а! random," and “level of confidence" carry certain 
intuitive meanings, but they must be made precise in some quantitative 
structure that we can apply mathematically. Notions of chance occurrences, the 
likelihood of an event, and betting odds have been in man's thinking (and 
acting! from the earliest times. Quantifying the notions in a coherent 
framework dates from the middle of the seventeenth century. Since that time 
mathematicians and philosophers have continued the building and refinement 
of the structure. The mathematical subject matter is called probability (or 
mathematical probability or probability theory). In this chapter we shall try to 
give a grasp of those elements of the subject most needed for statistical 
inference. 


42 DEFINITION OF PROBABILITY 


DEFINITION 4.2.1 At least intuitively, if we wish to assign a nu- 
merical value to the likelihood of a chance event, we would use as our 
number the proportion of times the event occurs in an enormously long 
sequence of opportunities for the event to happen. 


We think of 1/2 as a reasonable measure of the likelihood of getting “head” 
when we toss a coin because we believe that a balanced coin should turn up 
“head” one half of the time if the coin is tossed a very great number of times. 
If we roll a balanced six-sided die whose faces are marked with the customary 
number of dots (1, 2, 3, 4, 5, 6), we would consider 1/6 as a reasonable 
measure of the likelihood of having the die show four dots on its top face after 
coming to rest, because we believe that in a very long sequence of rolls the face 
showing “4” should end up in top position one-sixth of the time. 


On such basis we could agree to start the quantifying of likelihood by 
deciding that the numerical measure of the likelihood of an event shall be a 
number between 0 and 1. This has been the established convention from the 
earliest days of the mathematical development of the subject. The next basic 
convention is to give the name probability to the numerical measure of 
likelihood. 


104 m STATISTICS AND CHANCE 


Thus we say that the probability of tossing a head is 1/2, the probability 
of rolling a “4” with a single die is 1/6; and we use the notation 


P(head) = 1/2, P(four) = 1/6. 


If there is any danger of confusion as to the chance situation being discussed, 
we can insert into the parenthesized expression a statement of the underlying 
condition, using a vertical bar to set it off: 


P(head | toss of a balanced coin) = 1/2, 
P(four | roll of a balanced die) = 1/6. 


The vertical bar is often read “given,” so that we read the above statements as 
“the probability of head, given the toss of a balanced coin, is 1/2," "the 
probability of four, given the roll of a balanced die, is 1/6." We regularly omit 
such statements of condition when the chance situation has been clearly 
identified at the beginning of the discussion into which the probability enters. 


The foundations of a mathematical theory of probability were laid by the 
great philosopher-mathematician Blaise Pascal (1623-1662) and the eminent 
mathematician Pierre de Fermat (1601-1665), in an exchange of correspond- 
ence initiated by questions to Pascal by a French nobleman who found 
contradictions between standard gambling rules and his own experience. From 


this early work came the original—now called the classical—definition of 
probability: 


DEFINITION 4.2.2 If ina single trial of a chance situation there are 
t different possible fundamental outcomes that are exhaustive, mutually 
exclusive, and equally likely, and if f of these outcomes are favorable to an 
event A, then the mathematical probability of A is defined as the ratio f/t. 


Let us illustrate the classical definition of probability with five examples that 


help us to understand words and concepts like exhaustive, mutually exclusive, 
and equally likely in the definition. 


Example 4.2.1. Toss of a balanced penny. 


Here there are two possible outcomes, head or tail. These exhaust the 
Possible outcomes. The two outcomes are mutually exclusive; that is, if a head 
Occurs, a tail cannot, or if a tail occurs, head cannot. Finally, since the penny is 
balanced, there is an equal chance of getting a head or a tail in the toss of 
the penny. Hence t=2 in the above definition. 


4.2 DEFINITION OF PROBABILITY m 105 


Example 4.2.2. Toss of a balanced coin three times. 


Here the fundamental outcomes can be enumerated as the sequences HHH, 
HHT, HTH, HTT, THH, THT, TTH, and TTT. Thus t — 8. КА is the event 
two heads in the three tosses, then the fundamental outcomes favorable to A 
are HHT, HTH, and THH. Hence f = 3 and P(A) = 3/8. 


Example 4.2.3. Roll of a balanced die. 

Here the exhaustive, mutually exclusive, and equally likely fundamental 
outcomes are the six different numbers of dots that can appear face up after the 
die comes to rest: 1, 2, 3, 4, 5, 6. If A is the event four, then only the outcome 
4 is favorable, and so Р(А) = 1/6. If B is the event a number divisible by 3, then 
the favorable outcomes are 3 and 6, whence f = 2 and P(B) = 2/6 = 1/3. 


Example 4.2.4. Roll of a pair of balanced dice. 


Here we can identify a fundamental outcome by giving a pair of numbers, 
the first referring to Die No. 1 and the second referring to Die No. 2. Thus 
there are 36 fundamental outcomes, shown as follows. 


Fundamental Outcomes 


11. 24 Чї АЛ "541 28/1 
12 22. 32. 42 -52 62 
13 23 33 43 53 63 
14 24 34 44 54 64 
15 25 35 45 55 65 
16 26 36 46 56 66 


In this example t — 36. Suppose that we are interested in the total number of 
dots showing on the top faces of the two dice when they come to rest after 
being tossed. In dice games this total is usually called the point that is rolled. If 
A is the event point 7, then the fundamental outcomes favorable to A are 
(1,6), (2, 5), (3, 4), (4, 3), (5, 2), and (6, 1). Thus f — 6, and P(point 7) = 6/36 = 
1/6. 


Example 4.2.5. Drawing an upper-class student at random from the Mecca 
Community College sample. 

To draw a unit “at random” from a collection of units means to choose a 
unit by some process whereby every unit in the collection has equal chance of 
being drawn. One elementary way of doing this would be to put into a box an 
appropriately labeled tag for each unit in the collection, shake the box 
vigorously so as to mix the tags well, and then draw one tag from the box. In 
such a random drawing, the individual units in the collection are the set of t 
exhaustive, mutually exclusive, and equally likely outcomes referred to in the 
classical definition of probability. 


106 m STATISTICS AND CHANCE 


In the present example we thus have t— 180. Suppose we ask for the 
probability that a student drawn at random will have no party preference 
politically. We have only to count the number of Ns in the data set, find it to 
be 64, so that f — 64, and then argue: 


P(no party preference) = E107 0.356. 


In calculating probabilities by the classical definition, the whole problem 
resides in making an accurate count of the total number of possible outcomes 
and the number favorable to the event under consideration. This can become 
very complicated as soon as we move away from straightforward cases. Making 
such counts comes under the heading of “combinatorial” mathematics, a 
subject beyond our needs for this book. Also, most practical problems do not 
have such nice cleanly defined outcomes that are equally likely. Thus the 
classical definition of probability had to be generalized in two directions: (a) to 
allow the fundamental outcomes to differ as to likelihood and (b) to cover cases 
where the possible outcomes are too numerous to count. Before taking up this 
aspect of probability, let's consider the practical implications of a well-defined 
classical case. 


4.3 THE PRACTICAL MEANING OF PROBABILITY m 107 


43 THE PRACTICAL MEANING OF PROBABILITY 


Let us return to the matter of interpretation with which we began. The 
probability P(A) of an event is the long-run proportion of times that the event 
A occurs in а sequence of trials of the experiment. The proportion of A 
occurrences in any finite number n of trials keeps changing as n changes; P(A) 
is the limiting value of these proportions as n increases without bound, going 
on to infinity. 


For example, consider the experiment of rolling a pair of dice and getting a 
total of 7. We have seen that the probability of getting a total of 7 is 1/6 
(20.167). In two different actual experiments of rolling a pair of dice 504 
times, the first 18 rolls yielded the cumulative records shown in Table 4.3.1. 
Notice the wide oscillation of the proportion of 7s rolled in these early stages 
of the sequence of trials. The cumulative records for trials 15 through 504 are 
shown graphically in the diagrams of Figure 4.3.1 on the next page. 
TABLE 4.3.1 


—_——.———+—-—-—-——-—Є—Є—Є—Є—— 


Ехрегїтепї 1 Ехрегїтепї 2 
ie 2 Fo. E ки и 

Number Cumulative Cumulative Number Cumulative Cumulative 
of number proportion of number proportion 

rolls of 7s of 7s rolls of 7s of 7s 

1 T 1.000 1 0 .000 

2 1 .500 2 0 .000 

3 1 .333 3 0 .000 

4 1 .250 4 0 .000 

5 1 .200 5 0 .000 

6 1 .167 6 1 .167 

7 1 .143 7 1 .143 

8 1 :125 8 2 .250 

9 1 111 9 2 .222 

10 1 .100 10 2 .200 

11 1 .091 11 2 182 

12 1 .083 12 2 .167 

13 2 .154 13 3 .231 

14 2 .143 14 3 .214 

15 3" 200 15 3 .200 

16 3 .188 16 3 .188 

17 4 .235 17 3 176 

18 4 222 18 3 167 


".On the first roll, a 7 was obtained, and the second 7 was gotten on the 13th roll, thus making а cumulative total 
of two rolls each getting a 7; the third 7 obtained on the 15th roll, and so on. 


108 m STATISTICS AND CHANCE 


Experiment 1 


0 
10 20 60 100 140 180 220 260 300 340 380 420 460 500 
Number of rolls (n) 


200 Experiment 2 


> 


10 20 60 100 140 180 220 260 300 340 380 420 460 500 
Number of rolls (22) 


FIGURE 4.3.1 Proportion (б) of n pair-of-dice rolls giving point 7. 


In each of the two experimental sequences there is evidence that the 
proportion of 7s is making some effort to settle down toward 1/6. However, 504 
rolls are not nearly long enough a “long run" to bring the proportion 
consistently very close to 1/6. We can calculate later in the chapter that 
something over 5000 rolls are required to give 95 percent probability that the 
observed proportion will be within .01 of 1/6. To raise that probability to 99.99 
percent requires 21,000 rolls. Three such sequences performed by an eager 
electronic computer had representative stages in their history as shown in 
Table 4.3.2. Thus the above indicates that it takes a very large number of rolls 
of a pair of dice to arrive at the expected probability level. 


4.3 THE PRACTICAL MEANING OF PROBABILITY m 109 


TABLE 4.3.2 


Number Cumulative Proportion of 7s 
of ——— 
rolls Sequence 1 Sequence 2 Sequence 3 


100 .1700 .1900 -1400 
200 .1900 .1500 .1450 
300 .1800 .1367 .1500 
400 1775 .1300 .1500 
500 .1720 .1320 .1500 
600 .1667 .1250 .1533 
700 .1629 .1286 .1514 
800 .1675 .1275 .1513 
900 .1689 .1267 .1467 
1000 .1700 .1370 .1450 
1500 .1633 .1453 .1453 
2000 .1620 .1600 .1485 
2500 .1660 .1568 .1496 
3000 .1633 .1620 .1517 
3500 .1614 .1669 .1523 
4000 .1628 .1675 .1565 
4500 .1676 .1698 .1549 
5000 .1660 .1692 .1576 
6000 .1677 .1647 .1575 
7000 .1650 .1647 .1583 
8000 .1674 .1646 .1599 
9000 .1676 .1660 .1599 
10000 .1664 .1658 .1613 
12500 .1637 .1680 .1639 
15000 .1629 .1691 .1655 
17500 „1622 „1694 1655 


2000 1628 .1681 .1647 


In other practical situations the argument deals with the prevalence of a 
certain characteristic in a finite population of elements: lung cancer in a 
population of male smokers, individual weight between 1 and 2 pounds in a 
school of fish, tree girth between 6 and 12 inches in à forest, favorable 
opinion about a referendum question in a community of voters. If we designate 
by A the characteristic of interest, then P(A) is the measure of likelihood that 
A will be exhibited by a single individual member drawn at random from the 
population. With respect to the entire population, P(A) has the important 
related meaning as the proportion of the population that is composed of 
A-type elements. Thus P(lung cancer) = .05 indicates that 5 percent of the 


110 m STATISTICS AND CHANCE 


related population has lung cancer; P(fish weight between 1 and 2 pounds) = 
.70 indicates that 70 percent of the fish in the associated population are of 
weight 1-2 pounds each. In a matter of opinion in a population of 5000 voters, 
P(favorable opinion) = 3/10 can tell us that 30 percent of the voters in that 
population hold a favorable opinion. 


The above interpretations of the probability of an event, in terms of the limit 
of the proportion of times the event occurs in an unending sequence of trials, 
or the proportion of an entire population exhibiting the characteristic which is 
specified by the event, give practical meaning for long-run or overall experi- 
ence. But what of the practical meaning for a single trial? For example, we 
know that the probability of rolling point 7 in a single fair roll of a pair of dice 
is 1/6, but when that roll is actually performed the result will be either a 7 
or not a 7. All that probability tells us is that the latter outcome is five 
times as likely as the former, since P(7) = 1/6 while P(not а 7)-1- (1/6) = 
(5/6).* 


The gambler uses probability to set fair *odds" for а single trial. He argues, 
for example, that since not-7 is five times as likely as 7, the person who bets 
$1 that he will roll a 7 should receive $5 if he actually rolls it, and the 
statement is made that “fair odds are 5-to-1 (or 5:1)." These odds are 
considered "fair" on the basis of equating "expected" gain with "expected" 
loss, expectation being the result of weighting the gain (or loss) by the 
probability of attaining it. If the roll is made at 5 :1 odds, the expected gain is 
(1/6)($5) = $(5/6), and the expected loss is (5/6)($1) = $(5/6). Such expectations 
are again “long-run” concepts since they involve probabilities. Thus the logic 
of setting such fair odds is again a “long-run” argument: in the long run the 
gambler will have gains to balance losses. 


DEFINITION 4.3.1 In general the mathematical odds in favor of A 
(against not-A) are a:b, where a and b are integers satisfying 


P(A) 
1-P(A) 


* 

It jue be a general rule that for any event A, P(A)- P(not-A) = 1, since the events A and 
not- exhaust all possibilities, thus accounting for the totality of probability while at the same 
time having no overlap of outcomes common to them both. 


4.3 THE PRACTICAL MEANING OF PROBABILITY m 111 


Thus the mathematical odds in favor of rolling 7 are given by 


so that we say the mathematical odds are 1:5 in favor of rolling 7. In cases 
where a <b it is often preferred to state the odds as b:a against A. Thus the 
odds for rolling 7 can be stated as 5:1 against rolling 7. 


Gambling odds are always taken as b:a since these odds are intended to 
balance out expectations based on mathematical odds, and 

агар e dme 
оса 1= 1 11: 

Thus іп summary we match reality to the formal definition of mathematical 
probability by interpreting P(A), the probability of event A, as: (a) the 
long-run proportion of times that A occurs in an unending sequence of trials, 
(b) the proportion of the population that is composed of A-type elements, 
(c) the single-trial likelihood of the occurrence of A, expressible as P(A): 
[1— P(A)] odds in favor of the occurrence of A. 


EXERCISES 


4.3.1 In Example 4.2.2 we calculated the probability of getting exactly two heads in 
three tosses of a balanced coin. We found P(2 heads|3 coin tosses) = 3/8. 
Calculate the probabilities of the other possible events; that is, find P(0 heads), 
P(1 head). P(3 heads). 


4.3.) Make the probability calculations for all possible numbers of heads in four 
tosses of a balanced coin. 


4.3.3 In Example 4.2.4 we considered the rolling of a pair of fair dice and calculated 
as 1/6 the probability of rolling “point” 7. Calculate the probability of each 
possible “point” that can be rolled. That is, find P(point 2), P(point 
3),..., P(point 11), P(point 12). 


4.3.4 A deck of bridge playing cards contains 52 cards, composed of 13 “Чепопипа- 
tions” of each of four "suits." The suits are clubs, diamonds, hearts, and spades. 
Clubs and spades are black in color, and diamonds and hearts are red. The 
denominations are 2, 3, 4, 5, 6, 7, 8, 9, 10, jack, queen, king, and ace. Such a 
deck is well shuffled and a card is drawn at random. What is the probability that 
the card will be: (a) a heart, (b) red card, (c) ace, (d) face card (jack, queen, or 
king), (e) card with denomination under 10, (f) one-eyed jack (jack of 
hearts and jack of spades are pictured in profile, showing one eye)? 


112 m STATISTICS AND CHANCE 


4.3.5 


4.3.6 


4.3.7 


4.3.9 


4.3.10 


4.3.11 


If a student is drawn at random from the Mecca Community College sample, 
what is the probability that the student: (a) is a female, (b) has preference for 
the Republican party, (c) has no opinion about legalizing marijuana, (d) 
commutes more than 20 miles? 


If the student drawn at random in Exercise 4.3.5 is seen to be a female, then 
what are the probabilities in (b), (c), and (d)? 


What is the probability that a randomly chosen Mecca College sample male will 
disagree on legalizing marijuana? What is the corresponding probability for a 
female? 


What is the probability that a sample student randomly chosen from those 
commuting less than 5 miles will have Democrat political preference? What is 
the corresponding probability for a student commuting more than 20 miles? 


A cage of inoculated laboratory mice contains four males and four females. An 
adjoining cage of noninoculated mice has 10 males and six females. By accident 
the barrier between the cages is released, and in the morning the experimenter 
finds the mice all mixed up in a single cage. If one mouse is selected at random, 
what is the probability that it is: (a) inoculated, (b) male, (c) inoculated, given 
that it is male? 


Eighteen students went on an all-day hike. Six got sunburned, five got bitten by 
chiggers, nine made it without either of those misfortunes. What is the probability 
that: (a) a sunburned hiker escaped the chiggers and (b) a bitten hiker was also 
sunburned? (Hint: use logic on the counts to identify how many hikers were 
both bitten and burned.) 


In one of the manufacturing plants of a phonograph record company, machines 
A, B, and C turn out pressings, machine A producing 30 percent of the total 
output, machine B 50 percent, and machine C 20 percent. Each machine has a 
certain fraction defective in its output: A, 3 percent; B, 2 percent; C, 4 percent. 
Out of the joint production a record is drawn at random and found to be 
defective. What are the probabilities that it was pressed by machine A, B, or C, 
respectively? (Hint: taking total production as the general amount T will 
enable you to make all of the necessary calculations.) 


4.4 INDEPENDENT EVENTS m 113 


44 INDEPENDENT EVENTS 


Of great importance in the analysis of chance situations is the notion of 
independence in the probability sense. An intuitive idea is that events A and B 
are independent (in the probability sense) if the occurrence of one of them has 
no effect on the probability of the occurrence of the other. We apply such an 
idea naturally when A and B are events in successive tosses of a coin or in 
successive rolls of a pair of dice. It is the basis of the gambler’s rule that “а coin 
has neither memory nor conscience." 


The development of the mathematical definition of independence usually 
proceeds through consideration of conditional probability —the probability of 
an event conditional on the occurrence of another event. We do not wish to go 
into this subject in any detail, and so ask the reader to accept a definition of 
independence and settle for some numerical examples of the implications 
respecting conditional probability. 


The joint occurrence of events A and B is called the compound (or 
intersection) event AB. Some examples are: head on two successive tosses of a 
coin, “4” and then “7” in two successive rolls of a pair of dice, male and then 
female in two random selections of students from the community-college 
sample, male and democratic party preference in a random selection of one 
student from the college sample, sunburned and bitten by chiggers in Exercise 
4.3.10. 


DEFINITION 4.4.1 The events A and B are independent if and only if 
P(AB) = P(A): P(B). 


It is because of such independence that 


P(HH | coin tosses) = P(H): P(H) -5*57 3: 


114 m STATISTICS AND CHANCE 


Tallying the Mecca College sample data leads to the following table of the 
numbers of students distributed as to sex and political party preference: 


Political 
н Democrat | Republican No. pref. 
(D) (N) To 
7 


tal 
Female (F) 2 21 27 76 
Male (M) 33 29 104 


In the data of the community-college sample above, there are 104 males and 76 
females. Hence for a single random drawing 


104 26 _ 76 19 
180 45) PH) = 180 45: 


If we make two random selections of students by drawing a name, noting it, 
replacing it in the collection, then making a second random selection, we have 
an example of independent events, since 


P(M)- 


M i i ee gO = 
P[(M, Е) | drawing with replacement] 180 180 P(M) · P(F) 
But if we do not replace the first draw before making the second selection, then 
with only 179 names left and 76 of them are female, the joint probability is 
written: 


P[(male, female) | drawing without replacement] - 104 76 


180 179' 


and this is not P(male) - P(female). Hence such draws are not independent. 


4.44 INDEPENDENT EVENTS m 115 


The event “male and Democratic party preference" is the outcome of the 
random drawing of one student provided that student is one of the 33 students 
shown in the corresponding cell of the table. Hence 


M pe 
P(MD) = 780260: 
But 
104 26 "8o ^i 
P(M)-igg74s:  P(D)71gg73: 
so that 
26 1 26. 
P(M): P(D)=35 37 T35: 


Thus P(MD) # РОМ) · P(D), so that male and Democrat are not independent. 


In fact, we can say that sex and political party preference are not independent, 
since independence of those characteristics would require that the multiplica- 
tion equality hold for all eight sex-party combinations, whereas we have 
already seen the equality fail for the Male-Democrat combination. 


More than two events are considered independent if multiplication equalities 
hold for all compound events formed of any two of them, any three of them, 
any four of them, and so on. 


DEFINITION 4.4.2 The event Ai, ..., Ax are independent events 
if and only if the probability of the joint occurrence of any collection 
of the individual events equals the product of the probabilities of the 
individual events in the collection. 


116 m STATISTICS AND CHANCE 


In the practical applications of probability theory, independence is of special 
importance when an investigator makes repeated *'trials" of a chance situation, 
such as repeated tosses of a coin, repeated rolls of a pair of dice, a treatment 
applied to a number of patients, blood pressure measured on a number of men, 
or repeated blood-pressure measurements made on a single individual. Such 
repeated trials are called independent repeated trials if the same probability 
structure applies to all trials and the equalities in Definition 4.4.2 hold for the 
events made up of the outcome on trial 1, the outcome on trial 2, and so on 
through the outcome on trial n. The most commonly used methods of statistical 
inference require that the observed data come from independent repeated 
trials. Hence in any experiment to which such statistical inference is to be 
applied, an important consideration is assuring that the conditions for indepen- 
dent repeated trials are satisfied. Thus the restrictions “fair” coin and “fair” 
toss, "fair" dice and "fair" roll, and the requirements that we shall meet in 
Chapter 5 for having the proper kind of sample for generalizing to a popula- 
tion. 

In independent repeated tosses of a fair coin, for example, 


у Dp о, 1 
Р(5 successive 7з) = 6666 7776“ .0001286. 
(Looking at these probability values, the reader is likely to say that if he was 
present at the occurrence of five successive heads or five successive 7s in any 
Short-run experience, he would doubt the validity of the assumption *'fair." At 
that moment he will have grasped the logic underlying one major part of 
statistical inference!). 


4.5 BERNOULLI TRIALS. THE BINOMIAL DISTRIBUTION m 117 


45 BERNOULLI TRIALS. THE BINOMIAL DISTRIBUTION 


A particularly interesting class of experiments involving n independent 
repeated trials is that in which each trial has just two possible outcomes. The 
coin-tossing experiment is an example. Each item drawn from an industrial 
production line may be classified as either “satisfactory” or “defective.” With 
reference to a disease under investigation, a person drawn from a population 
may be classified as either “has the disease" or “does not have the disease." In 
general, for many investigations concerning a certain given characteristic, the 
point of interest is whether an observation reveals the presence of the charac- 
teristic or its absence. Since there are only two possible outcomes, they are 
always of the form “category A" and “category not-A.” To simplify terminol- 
ogy, one often uses in a general sense the terms "success" (S) and "failure" 
(F)—which frequently apply literally to the games of chance around which much 
of the early development of probability theory took place. Even more conven- 
ient are the labels 1 and 0, which have the added advantage that the sum of 
the observations is the total number of “successes” observed in n trials. The 
probability structure is completely defined by assigning to the two outcomes S 
and F (or 1 and 0) two nonnegative numbers totaling unity; it has become 
customary to designate these probability numbers by p and q, subject to 
р+а=1. A trial having such a probability structure is called a Bernoulli trial, 
named for the great seventeenth-century mathematician Jacob Bernoulli, who 
made extensive investigations in this area. 


When an experiment consists of n independent repeated Bernoulli trials, every 
possible result of the experiment is a sequence of n characters each of which is 
S or F (or 1, 0), such as {SFSSSF} in the case n = 6. We can talk of the n-tuple 
(yu y2...., ук) with each у being 1 or 0. Thus {SFSSSF} is the 6-tuple 
(1, 0, 1, 1, 1, 0). A matter of primary importance in such trials is the number of 
"successes" in the n trials. In our example with n — 6, the number of successes 
is 4. 

It is obvious that the possible number of successes in n trials is 0 or 1 or 2 or 
3or--- orn. пп = 6 trials, we can have 0, 1, 2, 3, 4, 5, or 6 successes. Thus 
the number of successes is a variable, and we say that it is a random variable 
because the specific value depends on chance. 


118 m STATISTICS AND CHANCE 


We can see how the pattern of probability for this kind of random variable is 
developed by considering the simple case of tossing a coin three times, as in 
Example 4.2.2. If we let 1— head and 0 = tail, then all possible fundamental 
outcomes of the experiment are given by the triples: 


(1, 1, 1), (1, 1, 0), (1, 0, 1), (1, 0, 0), (0, 1, 1), (0, 1, 0), (0, 0, 1), (0, 0, 0). 


Each outcome is the result of three independent repeated Bernoulli trials in 
which Р(ѕиссеѕѕ) = (1/2). Thus, each fundamental outcome has probability 
(1/2)(1/2)(1/2) = 1/8. An event having to do with the number of heads in the 
three tosses is made up of the appropriate fundamental outcomes. For exam- 
ple, *two heads in three tosses" is the event that occurs if the fundamental 
outcome is either (1, 1, 0), (1, 0, 1), or (0, 1, 1). We can say that the event “two 
heads in three tosses" is the set of fundamental outcomes composed of the 
outcomes (1, 1, 0), (1,0, 1), and (0, 1, 1). It is customary to use braces to 
indicate such sets, and we can write 


two heads in three tosses = ((1, 1, 0), (1, 0, 1), (0, 1, 1)}. 


The probability of such an event is the total probability that goes with the 
collection of fundamental outcomes. Thus we have 


P(two heads in three tosses) = P({(1, 1, 0), (1,0, 1), (0, 1, D} = iii^ 2. 


Considering all possible numbers of heads, we set down the various likeli- 
hoods: 


P(no head in three tosses) = P({(0, 0, 0)}) = ` у 


P(one head in three tosses) = P(1,0,0)(0,1,0),(0, 0, D) =, 
P(two heads in three tosses) = P({(1, 1, 0), (1,0, 1), (0, 1, D) = 3, 


P(three heads in three tosses) = P({(1, 1, 1)}) -i ; 


45 BERNOULLI TRIALS. THE BINOMIAL DISTRIBUTION m 119 


We can summarize the entire pattern by using Y to stand for “ће number of 
heads in three tosses," y to represent various specific values of Y, and then 
write down a table: 


y Р(Ү=у) Y = number of heads in 3 tosses of a coin 
о в 
à 3 As demanded by logic, the sum of the probabilities 
8 in the table is 1, since Y 2-0, Y - 1, Y - 2, and Y -3 
2 8 exhaust the totality of all possible outcomes for Y. 
1 
~ в 


Rather obvious changes in the argument will take care of ап unbalanced coin 
for which we know the probability of head. As example, suppose Р(ћеад) = 
(1/4) in any single toss, and we again take three independent tosses. Then the 
pattern develops as follows. 

Outcomes: (1, 1, 1), (1, 1, 0), (1, 0, 1), (1, 0, 0), 
(0, 1, 1), (0, 1, 0), (0, 0, 1), (0, 0, 0) 


Probabilities: —'Z'— и ала Ж a ad: 


y Event Y=y P(Y=y) 
3.3 .(1Y(3y 27 
0 {(0, 0, 0)} 3.3 act) (a) 64 
133.31 3,33.1_,/1\'/3\7_27 
1 (01,0, 0), (0, 1, 0), (0, 0, 1)} 13 yii aaa) (а) ~ 64 
112131810 9\19. 
2 {(1,1,0), (1,0, 1), (0, 1, 1)} 1.1.3.1.3.1+4-4-4=304) (3) 7 64 
түтү ЗҮ 1 
3 (0,1, 1) yaar (a) G) 78 


Probability of success 


120 m STATISTICS AND CHANCE 


It is a small step to generalize this to cover the case of any given probability 
of heads: wherever we have had 1/4, use the general value p; wherever 3/4 
appears, use q (which is 1— p). A bit more effort is needed to generalize from 
three tosses to any desired number, say n tosses. There is indeed a general 
formula, and it is the basis of calculating probabilities for this large class of 
situations. For our purposes, however, some representative cases shown by 
tables will satisfy our needs. 


The important notion about Bernoulli trials is the idea of how that 
mathematical model fits a wide variety of chance situations. It is applicable in 
every case where the center of interest can be interpreted as 


Y =the number of "successes" in n independent trials 
of an experiment wherein the probability 
of success in a single trial is p. 


Тће random variable identified in this manner is called a binomial random 
variable, and the pattern of its probability structure is called the binomial 
distribution. 


For example, the patterns of the probability structure for the cases of tossing 
à coin three times worked out above are shown graphically as follows: 


Unbalanced coin 
Balanced coin 


3/8 д 24/64 
Ф 
3 
2/8 5 16/64 
= 
: E 
18 8 8/64 
a 
9 1 2 3 0 1 2 3 


Number of heads Number of heads 


45 BERNOULLI TRIALS. THE BINOMIAL DISTRIBUTION m 121 


The parameters of the binomial distribution are n and p, a specific numerical 
distribution being produced for each specific set of values assigned to n, p. The 
above case of the balanced coin has n —3, p = (1/2); the case of the unbal- 
anced coin is given by n =3, p = (1/4). Table А-2 in the Appendix gives some 
representative cases. Because practical problems involve a bracket of y values 
more often than single individual values, published tables usually give their 
information in the form of cumulative sums. Table A-2 gives such cumulative 
sums for the number of successes up to and including the tabled y value. For 
example, the entries for п= 5 and p —.40 read as follows: 


р = 40 


y 

0 .0778 
1 .3370 
2 .6826 
3 .9130 
4 .9898 
5 1.0000 


The heading of Table A-2 includes the statement “For designated values of n 
and p, the tabled entry gives P(Y <y).” Thus the above entries stand for the 
following: 


P(Y=0)= .0778 
P(Y =0 or 1)= 3370 
P(Y=0 or 1 or 2)= .6826 
P(Y=0 or 1 or 2 or 3)= .9130 
P(Y =0 or 1 or 2 or 3 or 4)= .9898 


P(Y =0 or 1 or 2 or 3 or 4 or 5) = 1.0000 


Example 4.5.1 

In a certain very large population it is hypothesized that 30 percent of the 
individuals need dental treatment. If 10 persons are drawn at random from the 
population and examined, what is the probability that the number of persons 
found to need dental treatment will not exceed 3, assuming the hypothesis to be 
correct? Here the large size of the population makes it reasonable that we 
consider the 10 persons to be 10 independent trials of a situation wherein the 
probability of **dental treatment needed" is 0.30 at each trial. Thus if Y= the 
number of persons needing dental treatment, Y can be considered a binomial 
random variable with n=10 and p=.30. From Table A-2 we answer the 
stated question as follows. 


P(Y <3) = .6496. 


122 m STATISTICS AND CHANCE 


The probability of finding exactly 3 in need of dental treatment can be 
calculated as 
P(Y =3) = P(Y x3)- P(Y x2) = .6496 — .3828 = .2668. 
The probability of finding at least 6 (6 or more, as many as 6) in need of dental 
treatment comes from the table as 
P(Y =6) = P(opposite of Y x5) = 1— P(Y <5) = 1—.9527 = .0473. 

You may have noticed that the entries in Table A-2 do not include any 
values of p greater than .50. This is generally the case with published tables of 
binomial distributions. It is understood that any binomial situation involving a 
value of p greater than 1/2 can be rephrased in terms of counting the number 
of "failures," for which the probability at each trial is 4= 1— p, giving a 
binomial distribution with probability parameter less than 1/2. 


Example 4.5.2 


Suppose a machine has a probability 0.9 of operating successfully during a 
day's shift, and its day-to-day operations are independent. What is the proba- 
bility that it will operate successfully at least 13 days out of the next 15? We 
can argue as follows. 


У = the number of successful days; 

Y is binomial, n — 15, p=0.9. 

U=the number of unsuccessful days; 

U is binomial, n = 15, p=0.1. 
P(Y > 13) = P(U<2)=0.8159, 


The probability that the machine would Operate successfully no more than 10 
days can be calculated as follows. 


P(Y <10)=P(U=5) = P(opposite of U <4) 
=1-P(U <4) =1-.9873 =.0127. 


EXERCISES 
"uer A oe 0 15 ОМОТИ 


4.5.1 What is the probability of drawing five spades from a deck of bridge cards, if: 
(a) each draw is replaced and the deck reshuffled before the next draw, (b) 
each draw is kept out of the deck? 


4.5.2 What are the probabilities of drawing five cards of the same suit under 
conditions (a) and (b) of Exercise 4.5.1? 


4.5.3 In Section 4.4 we consider the Mecca College data distributed jointly as to sex 
and political party preference. We found Р(МР) # Р(М) · P(D), and thus 
concluded that sex and political party preference are not in general indepen- 
dent. Do any of the pairs of categories show independence? (That is, check out 
the other seven combinations MR, MO, MN, FD, FR, FO, FN.) 


4.5.10 


45 BERNOULLI TRIALS. THE BINOMIAL DISTRIBUTION m 123 


In the Mecca College data, are no opinion on legalizing marijuana and no 
political-party preference independent? 


A seed company claims that 70 percent of a certain kind of seed germinate. 
You plant 20 of the seeds and find that 12 sprout. What is the probability that 
no more than this number will sprout if the company's claim is correct? 


Suppose the probability is 1/5 that a certain type of missile will arrive and 

function properly in an assigned target zone. If it does arrive and function 

properly in the target zone, the target is destroyed. 

a. If five missiles are launched against one target, what is the probability that 
exactly one missile will arive and function properly in the target zone? 

b. If five missiles are launched against one target, what is the probability that 
the target will be destroyed? 

c. If 20 missiles are launched against one target, what is the probability that at 
least 1/5 of them (i.e., at least four) will arrive and function properly in the 
target zone? 


Last year a certain school-bond referendum was defeated, the proportion of 
“yes” votes being 40 percent. In a recent attitude survey, 15 voters were 
questioned. Three said they are in favor of a school-bond issue now, and the 
other 12 registered disapproval. What is the probability of a response as small 
as this if the general attitude is the same as last year and the 15 voters 
questioned are a random sample of the corresponding population? If the 
probability is small, what questions does it raise about interpreting the result of 
the survey? 


In a certain multiple-choice test, each question offers four options for answer; 
one and only one of the four options is the correct answer. The test has 20 
questions. If a person taking the test chooses each answer by pure guessing, 
what is the probability that he will pass the test (a) if the lowest passing 
performance is 10 correct answers, (b) if the test is scored by subtracting the 
number of wrong answers from the number of correct answers, and then taking 
10 as the lowest passing score? 


The tennis courts in Park A are playable 80 percent of the days in a year. If 10 
contest days are chosen, what is the probability that games can be played on 
the courts at least 8 of the days, assuming the binomial distribution to be 


applicable? What can you see as an argument against the applicability of the 
binomial distribution? 


Suppose you complain about your experience with the seeds in Exercise 4.5.5, 
and are told that the probability you computed is large enough to allow the 
argument that chance has been at work while 70 percent is indeed the 
germination rate. You ask how small the probability has to be for ruling out 
that argument, and the reply is "well, under 5 percent." What then is the 
number of germinating seeds (out of 20) that will send you back to the 


complaint department? 


124 m STATISTICS AND CHANCE 
4.6 PATTERNS OF CHANCE 


In the preceding section we considered a probability structure covering a 
wide variety of chance situations by treating the essential feature as a random 
variable having the binomial distribution. In general, the use of probability 
theory goes this way—translate the chance Situation into a probability setup 
having a definable random variable, and then find and apply the probability 
distribution of that random variable. 


Sometimes the case is a very special one. Rolling dice gives an example. 
Here the interest generally centers on the "point" that is rolled—that is, the 
sum of the numbers of dots showing on the top faces when the dice come to 
rest. We have seen earlier how to calculate probabilities for the different 
"points." The entire structure can be computed and set down in a distribution 
table, where Y = the point rolled in a fair roll of a pair of fair dice. 


а ШК te ls ВЕЕ 
P(Y=y) 2/36 | 3/36 | 4/36 | 5/36 5/36 3/36 1/36 


broad classes of situations. Thus A = 


shall introduce a few others that have wide application. 


4.6 PATTERNS OF CHANCE m 125 


We have seen how the probability distribution of a random variable can be 
given by a table of probability values. Sometimes a random variable can have 
its probability values given by a formula—you plug in the value of y and the 
formula gives you an answer which is P(Y — y). Cases of these kinds can be 
shown graphically by a histogram. In a histogram, a horizontal scale (axis) 
shows the possible values y, and vertical bars give the probabilities P(Y — y) by 
height. Each bar is one unit wide, and is centered on the y-value to which it 
refers. The dice-roll random variable and the binomial random variable of 
Example 4.5.1 could have their distributions shown as follows. 


=у) 


P(Y 


Probability distribution of number Y who need dental treatment 
when Y is binomial with n = 10, p = 0.3. 


126 m STATISTICS AND CHANCE 


Note that in such histograms we have a correspondence between area and 
probability. Each bar has a width of 1 and a height of P(Y — y). Hence the bar 
has area 1X P(Y = y) = P(Y = y). The total area in the histogram is then the 
sum of all the probabilities, namely 1. 


This is a useful correspondence when we move on to random variables that 
have too many possible values to allow a histogram representation of their 
probability distributions. Take, for example, a case of Y —the weight (in 
pounds) of a child at birth. The possible values of weight are infinite in number, 
of an especially potent kind of infinity. For between any two values, say 9.6 
and 9.7, there is at least one more possible value taking account of finer 
measurement, and between 9.64 and 9.65 there is another, and between 9.643 
and 9.644 there is another, and so on and on. We say that weight is a 
continuous variable. 


It is not hard for our imagination to generalize from histograms to diagrams 
having so many vertical bars of such skinny width that the graph is an area 
bounded at the top by a smooth curve. In careful mathematical detail the end 
result is exactly that, so that we can make use of the area idea in very general 
cases. If the curve shown makes sense in some situation, and the total area 
under the curve is 1, then the shaded area is precisely the probability that the 
random variable Y will take on some value between a and b. 


Ду) 


T 


a 


b 


In a situation like this, we cannot label the vertical axis as probability since 
now probability is given only by areas, not by heights. We call the curve the 
probability density curve of Y and label the vertical axis by some notation like 
the one shown—f(y), read “eff of y", standing for the mathematical function of 
y which the curve graphs. Such a function is called a probability density function 
and either its equation or its graph gives the story; its use is given by taking 
areas. In most practical situations, published tables allow us to get our desired 
areas without drawing pictures and measuring. 


47 THE STANDARD NORMAL DISTRIBUTION m 127 
47 THE STANDARD NORMAL DISTRIBUTION 


The probability distribution with the widest use among continuous random 
variables is a special one given the name standard normal. It is sometimes 
referred to also as the Gaussian distribution, named for the German 
mathematician Karl Friedrich Gauss (1777-1855), who had much to do with its 
derivation and early use in the study of measurement errors. 


The graph of the standard normal probability density function is the 
so-called “bell-shaped curve": 


v2) 


-3 =2 -1 0 1 2 3 
The standard normal distribution. 


This distribution arose in the so-called "theory of errors" by study of the 
random variations (“еггогѕ”) that occur in making measurements. If we meas- 
ure the height of a basketball player, the weight of a sleeping bag, the speed of 
a bullet, the distance between two planets, the LQ. of a student, repeated 
measurements will vary around the “true” value. If the “overs” and “ипдег5" 
are expressed in terms of multiples of their standard deviation, the probability 
pattern in the most common cases is like the above graph. 


The distribution came to light also in the early searches made to find 
approximations to the probabilities in the binomial distribution. Even the brief 
discussion about the binomial distribution in Section 4.5 must have suggested 
to the reader that calculating the probabilities must be a real chore whenever n 
is large. That is indeed a fact, and so it was a major discovery (De Moivre, 
1733) that the binomial distribution has a closer and closer relationship to the 
above graph as n gets larger and larger. We shall see how this works in Section 


128 m STATISTICS AND CHANCE 


In the years since its formulation, the standard normal distribution has been 
found useful for describing the probability patterns of a wide variety of random 
variables of the continuous type. It has been the hardest worked, and some- 
times most badly overworked, of all known probability distributions. The 
nomenclature normal has nothing to do with being the opposite of “ађпог- 
mal." The term arose apparently through the notion that when a random 
variable is expressed as a directed distance of so many standard deviations 
away from the mean, it has been "standardized" or “normalized.” 


This distribution appears in so many various applications that we set aside 
notation for its own use: Z for the standard normal random variable, z for the 
numerical variable indicating the values that Z can assume, and (z) for the 
mathematical function which defines the curve. That function is specifically 


ате 


1 
e()- = exp 2 ), —%0<2 «o, 


but this need not concern the reader so far as finding probabilities is concerned, 
since area is the name of our game, and areas are given by readily available 
tables. 


Note that the standard normal curve is defined to go along off to the left as 
far as “minus infinity" and off to the right as far as “plus infinity." Obviously 
no one will ever get to see either end, and a logical purist can raise the 
objection that the standard normal distribution cannot match anything in real 
life since nothing measurable involves the infinitely negative or the infinitely 
positive. But you can see from the graph that the area under the curve beyond 
z — —3 on the left or beyond 2 = +3 on the right is very small. In the next 
paragraph we shall see how small. Thus as in so much of mathematics, the 
standard normal probability distribution can approximate reality since the 
probability shown for unrealistic values of z is so small as to be “zero for all 
practical purposes.” 


Table A-3 is an abridged table of the standard normal distribution. It is 
arranged in such a way as to be most convenient for general use. Tables giving 


more detailed entries are widely available in textbooks, mathematics hand- 
books, and separate volumes. 


47 THE STANDARD NORMAL DISTRIBUTION m 129 
p(z) 


Shaded area = P(Z < z*) 


p(z) 


Area = .1587 


-1.0 0 
P(Z < —1.0) = .1587 


v) 


Area 7 .7258 


0 06 
P(Z « 0.6) = .7258 

When we refer to Table A-3 with a value of z, the corresponding reading is 

the area under the curve to the left of that z value. The column is labeled 

P(Z = z) since that is what the indicated area represents. By obvious maneuv- 

ers we can apply such pieces of information to produce the probability of any 

desired interval of z values. For example, the last two diagrams give im- 
mediately 


P(Z>-1.0)=1 -р(2<-1.0) =1- .1587 = .8413, 
Р(2>0.6)= 1—P(Z <0.6) = 1- .7258 = .2742. 


130 m STATISTICS AND CHANCE 


Abbreviated sketches make short work of figuring out probabilities for z 
intervals. The following are examples. 


6915 
1151 
-1.2 0 05 
P(—1.2 < Z < 0.5) = .6915 — .1151 = .5764 
.9987 
.0013 


-3.0 0 3.0 
P(Z « —3.0) - .0013, 
P(Z > +3.0) = 1 — .9987 = 0013, 
P(-3.0 < Z < +3.0) = .9987 — .0013 = 9974. 


Note that the probability for a z interval is the same whether or not we include 
either or both endpoints, since area is unchanged whether or not a boundary 
line is included, there being no area on a line. 


By using Table A-3 in reverse fashion, we can answer questions about z 
values which bound any desired likelihood region of Z. The following are 
examples. 


Given: 


area .10 g 


—1.282 0 


47 THE STANDARD NORMAL DISTRIBUTION m 131 


What is the limit of the 10 percent most negative z values? We graph the 
situation at hand, identify the area which is of the kind given by Table A-3, 
and read off the z value. The 10 percent most negative z values are z « 


-1.282. 


0.95 < 


Given: 
area .05 


mm 


0 1.645 


Beyond what z value is the likelihood of Z no more than 5 percent? The 
questioned z value has 5 percent probability to its right, and hence 95 percent 
probability to its left. Entering Table A-3 with P(Z<z)=.9500, we read 

= 1.645. The diagram shows that апу z value from 1.645 on out to the right 
has 5 percent or less probability beyond it. 


What are the boundaries of the central z interval having 95 percent 
probability? 


9750 < 
0250 v 


:025 


—1.960 0 1.960 
р(1.960 < Z:« 1.960) = .95 


EXERCISES 

4/.1 Find the probability of each of the following statements of behavior of the 
standard normal random variable Z: (а) Z«-0.8, (b) Z«2.0, (c) 2 > 1.0, (d) 
Z>-0.2, (e) Z«12, (f) Z«-12, (p Z> 1.2, and (h) Zz -1.2. 

4.12 Find the following probabilities concerning the standard normal random variable 
Z: (а) P(-0.6<Z<1.6), (b) P(1.6<Z<2.6), (c) P(-1.9<Z<-0.9), (d) 
P(-1.5< Z =+1.5), (e) P(Z<-1.5 or Z=+1.5), and (f) P(Z«-1.5 or 22 
+2.0). 

4.1.3 Identify the value of 2 in each of the following statements about the standard 
normal distribution: (а) P(Z &z) = 10, (b) P(Z «2)- 70, (c) P(Z > z) = .70, (d) 
P(Z>z)=.10, (e) P(Z<z)=-50; (ђ Р(222)= .50, (g) P(Z<z)=.25, (h) 
Р(2 > х) = .80, and (i) P(-z «Z <7)=.90. 


132 m STATISTICS AND CHANCE 


4.7.4 Identify the values of z satisfying the following conditions: (a) the upper limit of 
the values of Z in the lowest 15 percent of the distribution, (b) the lower z limit 
of the upper 25 percent of the distribution, (c) the z value that has 60 percent 
probability to its left on the z axis, (d) the z value that has 92 percent 
probability to its right on the z axis, (e) the z value that is exceeded with 
probability .01, and (f) the z value that is exceeded 95 percent of the time. 


4.7.5 What are the boundaries of the central interval in the standard normal distribu- 
tion having: (a) 80 percent probability, (b) 99 percent probability? 


4.7.6 What are the cutting points for dividing the z axis into: (a) four parts having 25 
percent probability each, (b) five parts having equal probability? 


4.8 DESCRIPTIVE MEASURES IN PROBABILITY DISTRIBUTIONS 


In Chapters 2 and 3 we considered various ways of showing the pattern 
exhibited by a set of data, and the most common summarizing statistics to 
indicate central tendency and dispersion. For central tendency we dealt with 
the mean y or the median or the mode of the data, and for dispersion we made 
use of the range or the variance s? or the standard deviation s. 


Similar considerations are useful with regard to the probability distribution 
of a random variable. The pattern has been discussed in preceding sections. For 
а discrete random variable we have all the probabilities given by a table or a 
formula, and the pattern can be graphed by a histogram. For a continuous 
random variable the pattern is given by a probability density function, which is 
graphed by a continuous curve and which associates probabilities with areas 
under that curve. 


Descriptive measures like mean, median, and standard deviation play an 
important role in the study of a probability distribution. As in the case of a 
collection of data, these measures summarize the central tendency and disper- 
sion of the random variable. In most situations of practical interest they enable 
us to identify the arbitrary constants (parameters) in a distribution of known 
form and thus to identify the probability pattern completely. 


Recall the definition of sample mean: y = Xyln. This can be written as 


pape e 1 1 

Y mar У л) n(n) у 4. 
In this form we see у as a weighted average of the ys: each y-value is given the 
weight (1/n), we multiply (“weight”) each y-value by that amount (1/n), and 


add the results. This total is then divided by the sum of the weights, but here 
the sum of the weights is just 1: 


аала дү. ү 
n n n 


48 DESCRIPTIVE MEASURES IN PROBABILITY DISTRIBUTIONS m 133 


The mean of a random variable Y (we call it also the mean of the probability 
distribution of Y) is defined in a similar manner. But now the y values at hand 
are all the possible values that Y can assume, and the weights are the 
probabilities associated with those various values. Again the sum of the weights 
is 1, since the total probability mass (*weight") in any chance situation is 1. 


For any kind of random variable, such a weighted average is called its 
expectation or expected value, as well as being called its mean. The process of 
taking expectation is indicated by the notation &(L1): ECY), #(2), €( Y^), and so 
on. The shorthand notation is ш (the lower-case Greek letter mu), subscripted 
to show the related random variable if there is any danger of confusion as to 
what random variable is under discussion at the time. 


If Y is a discrete random variable, its probability distribution is given by 
the probability function f(y) = P(Y = у), and the mean of Y can be readily 
defined by complete formula: 


Mean 

Mean value POE 4 

Expected value oPY cp yy т i Ky koe 
Expectation 


Example 4.8.1. Ү = “point” rolled with a pair of dice. 


The probability function for this Y has been given in Section 4.6. We now 
can calculate the mean of Y as follows. 


y fly)=P(Y=y) у“ fly) 
2 1/36 2/36 
3 2/36 6/36 
4 3/36 12/36 
5 4/36 20/36 
6 5/36 30/36 
7 6/36 42/36 
8 5/36 40/36 
9 4/36 36/36 

10 3/36 30/36 

11 2/36 22/36 

12 1/36 12/36 
Any other value 0 0 


D 
— 


p. = (У) = (252/36) =7 


——— 


134 m STATISTICS AND CHANCE 


This is shown on the graph of the probability distribution in the diagram 
below, which shows that the center of gravity of this binominal distribution is 


#(Ү)= нш =7. 


A Probability distribution of point Y rolled with pair of dice 
6/36 
£3 
Ш 
è 3/36 |— 
& 

1/36 [— 

—> у 
1 2 3 4 5 6 7 8 9 10; 311 12 
и =%(У) = 7 


When Y is a continuous random variable, its probability distribution cannot 
be given by a probability function of the kind that takes care of a discrete 
random variable. We have instead a probability density function, and prob- 
abilities are given by areas under the graphed curve. In such a case we cannot 
find a weighted average of the y values by a simple summing process like the 
one above. Methods of the calculus are required, performing a particularly 
tricky kind of summation yielding a result having the same concept as above—a 
probability-weighted average of all possible y values. We shall set down the 
formula for this, but only for the purpose of completing our catalogue. The 
reader will never in this book be asked to calculate this formula—unless he is 
unwilling to take our word for the answer which we shall give wherever 
needed! 


If Y is a continuous random variable, having probability density function 
f(y), then the mean џ of Y is 


p — Y) -Í y © f(y) dy. (4.8.2) 
It is a point of interest in the history of mathematics that the symbol f above 


(called the integral sign) was devised by stretching out the capital letter 5, 
standing for sum. 


4.8 DESCRIPTIVE MEASURES IN PROBABILITY DISTRIBUTIONS m 135 


In Chapter 3 we took for our descriptive measure of dispersion in a 
collection of data the sample standard deviation s, found by taking the positive 
square root of the sample variance 5°, using one or another computing form of 
the formula 


aM 
IEEE 20052. 
n-1 

For the probability distribution of a random variable Y we use the same 
concept of averaging squared deviations of y values from their mean. Here the 
mean is the mean u of У, instead of the mean ӯ of the sample data, and the 
averaging is according to probability “weights.” We use the notation a^ for 
variance and o for standard deviation. Sometimes V(Y) or var (Y) is used to 
denote variance of Y. 


If Y is a random variable having probability function f(y) or probability 
density function f(y), the variance of Y (called also the variance of the 
distribution of Y) is denoted by a? or V(Y) or var (Y), and is defined as 


У (у=) у) if Y is discrete, 
оз= У(Ү)= #(Ү-Ар)=} ^. (4.8.3) 


= 


iz (y-uYf(y)dy if Y is continuous. 


Again the reader should look at the formula for the continuous case as only 
a specialized kind of summation which he will not be asked to calculate. 

The standard deviation of Y (called also the standard deviation of the 
distribution of Y) is с, the positive square root of the variance g°: 


а = 4o. (4.8.4) 


Notice that our notation is consistent with the following useful distinction 
between population and sample parameters: 
Population Sample 
заза З Араа 
Меап H 
а 


Мапапсе 
Standard deviation с 


2 


nan 
„о 


136 m STATISTICS AND CHANCE 


Example 4.8.2. Y= “point” rolled with a pair of dice. 


In Example 4.8.1 we found џ to be 7 for this Y. The variance and standard 
deviation of Y then come out as follows. 


y fly) у=ш (у-иу (y- и) fly) 
2 1/36 -5 25 25/36 
3 2/36 -4 16 32/36 
4 3/36 =3 9 27/36 
5 4/36 -2 4 16/36 
6 5/36 -1 1 5/36 
7 6/36 0 0 0 
8 5/36 1 1 5/36 
9 4/36 2 4 16/36 
10 3/36 3 9 27/36 
11 2/36 4 16 32/36 
12 1/36 5 25 25/36 


а" = 210/36 = 35/6 = 5.83 


variance of Y = 0? = 5.83; and 
standard deviation of Y = с = У5.83 = 2.41. 


Example 4.8.3. The binomial distribution. 


In the general case of this distribution, where У = the number of “successes” 
in n independent trials of an experiment wherein the probability of success in а 


single trial is p, the appropriate mathematics (with which we don't want to 
bother you) will show: 


Mean: и =пр 


Standard deviation: œ= vnp(1-— p) 


Binomial distribution (4.8.5) 


We can see an example of the consistency of this with the definitions (4.8.1) 
and (4.8.3) by using both procedures on the easy case of Y = number of heads 
in three tosses of a fair coin. Early in Section 4.5 this random variable was 


48 DESCRIPTIVE MEASURES IN PROBABILITY DISTRIBUTIONS m 137 


considered, and its probability distribution set down in tabular form. Let us 
take that now and apply (4.8.1) and (4.8.3): 


(y-a? (у=): У) 
M 
32 
3 
32 
3 
32 


y Ку) y: fly) y- 


œl- lw 00100 Él- 
NIW NI] NIA VIO | = 
мо Ale BI FIO 


- Cl Ww о 


By use of (4.8.5) we have the identical results: 
u = пр = 3(1/2) = 3/2, 
a? = np(1- p) = 3(1/2)(1/2) = 3/4. 
In the situation of Example 4.5.1, where we drew a sample of 10 persons 


from a population in which 30 percent need dental treatment, Y is binomial 
with n = 10 and р = 0.30, so that 


џ = mean of Y ^ np = (10)(.30) = 3.0, 
o = standard deviation of Y = Упр(1—р)= 4/10(.3)(.7) 2 2.17 1.4. 


The value np readily matches our intuition on what the “expected” number 
of successes should be. When we make 10 trials, each with success probability 
30 percent, we surely "expect" 30 percent of the 10 trials to be successes on 
the average. This on-the-average expected number is what the mean tells us. 

The standard deviation is not so readily comprehended intuitively, but we do 
know by common sense that the actual observed number of successes will vary 
around the mean. The standard deviation is a measure of that variability. 


Example 4.8.4. Binomial with n = 150, p = -30. 

Instead of taking a sample of 10 persons from the population of Example 
4.5.1, take at random 150 persons. Then the number Y of sampled persons 
needing dental treatment is binomial with n = 150 and p= 30. Then for this Y 
we have 


ш = пр = 150(.30) = 45.0, 
o = Упра — p) = /150С3)07) =У31.5 = 5.6. 


138 m STATISTICS AND CHANCE 


Example 4.8.5. The machine of Example 4.5.2. 


The machine has a probability 0.9 of operating successfully during a day's 
shift, and its day-to-day operations are independent. Considering Y as the 
number of successful days out of 15, we have: 


Y is binomial, n = 15, p = 0.9; 
u = пр = 15(.9) = 13.5, 
с = Ynp(1— p) = V15(.9)(.1) = V1.35 = 1.2. 


If we ask about successful days in the next 90-day period, we have: 
Y is binomial, n = 90, p = 0.9; 
ш = пр = 90(.9) = 81.0, 


с = vnp(1— p) = У90(.9)(.1) = V8.10 = 2.8. 


In this book we shall not be interested in extensive study of finding the mean 
and standard deviation of various random variables. That is part of a careful 
study of probability theory. Here we need just to get straight the concept of the 
mean and standard deviation of a random variable Y as distinct from the mean 
and standard deviation of a collection of data relating to Y. 


When Y is a random variable, it is associated with a chance process. That 
process has a mean w and a standard deviation с in accordance with the 
probability averaging discussed above. Each time we operate the process we 
get a specific numerical result—a y value given to us by Nature using the 
chance mechanism that is associated with the probability distribution of Y. We 
call such a y value an observation on Y, or an observation on the population of 
Y. A collection of such observations is called a sample from the population of Y. 
The number of observations in a sample is called the size of the sample. 


For most cases of practical interest in statistical analysis, such a sample, no 
matter how large its size, cannot tell us the complete story about Y—there are 
always more observations which could be made. Thus the mean у of the sample 
can never give us precisely u, the mean of the population of Y, nor can the 
sample standard deviation 5 give us the population standard deviation o. What 


we shall do with statistical inference is use y and s to make educated guesses 
about u and о. 


An essential point to keep in mind about a chance process is that the most 
we can know about it is the probability distribution involved. We can never 
know what a dice roll will show; all we can know is the likelihood of the 
various possible results. If Y is a binomial random variable, the most that can 
be known is the pattern of likelihood given by the probability function for the 
binomial distribution, and to completely know that requires knowledge of the 
specific values for n and p. Similarly for any other kind of random variable, the 


4.8 DESCRIPTIVE MEASURES IN PROBABILITY DISTRIBUTIONS m 139 


totality of knowledge concerning it is given by: (a) the form of its probability 
function or probability density function and (b) the specific numerical values of 
the arbitrary constants (parameters) in that function. 


In the important case of the general normal distribution, which is the 
generalization of the standard normal distribution studied in the preceding 
section, the mean p and standard deviation с completely specify the distribu- 
tion in accordance with the following diagram. 


m | y 
u—3o р-20 и–о и uto pt2o и + Зо 


Normal distribution with mean p. and standard deviation с. 


The probability structure is that of the standard normal curve centered at p 
instead of at zero, and using ø as the unit of measurement оп the horizontal 
scale. Such a random variable Y is designated normal, with mean p and 


standard deviation a. 


The shorthand notation N(p. с) is often used, so that we can write 
Y is N(p, 0) 


to mean *Y has the normal distribution with mean p and standard devi- 
ation c." 


For example, the probability distribution of scholastic aptitude test (SAT) 
Scores in a certain population is N(500, 50) and is shown in the following 
diagram. Here we see that ш = 500, and 550= џи +0, and so on. In other 


| ->y 
350 400 450 500 550 600 650 


words, 550 is just one standard deviation unit above the mean. Similarly 400 is 


two standard deviation units below the mean. 


140 m STATISTICS AND CHANCE 


It would be very useful to convert all the numbers on the y scale of 
measurement to the z scale of measurement that we used to discuss the 
standard normal distribution in Section 4.7. We could use a single table for all 
normal populations. 


The conversion formula gives "the standardized normal random 
variable Z^: | 


х= Х-и 


а 


Thus the overall situation is as shown in the following diagram. 


General ___------ —-. у 


scale 30 u—2a р-а р и + о и + 20 и + Зо“ 


A eal | сло | | 


Scale 3 2 57 0 7 2 3 


We see that 400 is —2 standard deviation units below д; 550 is +1 unit 
above џ, and so on. Thus, knowing the mean (ш) and standard deviation (a) of 
a normal random variable Y, we can translate back and forth between Y and 
the standard normal random variable Z by the relationship 


L^ Z, where Y is Ми, с) and Z is N(0, 1). (4.8.6) 


Example 4.8.6 

In the population to which applies the SAT score Y, which is N(500, 50), 
what proportion have scores below 560? We need only translate a statement 
about Y into a statement about Z, and then use Table A-3: 


48 DESCRIPTIVE MEASURES IN PROBABILITY DISTRIBUTIONS m 141 


P(Y <560) = У —500 360 300 


ң2<) 


= P(Z = 1.2) = .8849. 
Thus 88.49 percent of the population have SAT scores lower than 560. 


Example 4.8.7 
In the above population, what proportion have scores 425 or less? What 
proportion have scores over 600? 
Y 500 575—306) 


P(Y <425) = 50 ^ 50 


= P(Zx-1.5)-.0668. 


Thus 6.68 percent of the population have scores 425 or less. 


- p(% -500 600-500) 
P(Y >600) = P( T EET) 


100 
-r(z- 50) 
= P(Z>2.0) 
=1-P(ZS2.0) 


= 1—.9773 = .0227. 


Thus, 2.27 percent of the population have scores above 600. 
* The algebra of inequalities is very much like the algebra of equalities, with one additional 

wrinkle. We operate with the following in mind: 

(a) An inequality is unchanged if a common quan 
member, for example, 


tity is added to, or subtracted from, each 


5<7 5<7 
245«247 5-8<7-8 
7«9 -3«-1 
(b) An inequality is unchanged if each member is multiplied or divided by a common positive 
quantity, for example, 5<7 16>12 
2х5<2х7 16+4>12+4 
10<14 4>3 


(с) An inequality has its sense (direction) reversed if each member is multiplied or divided by a 
common negative quantity, for example, 
5<7 16> 12 
C2x5»C2x7 16+(-4)<12+(-4 


-10>-14 -4«-3 


142 m STATISTICS AND CHANCE 


Example 4.8.8 

In the same population as in the two preceding examples, what is the top 
score of the lowest 15 percent? We start with Table A-3, enter with area, read 
the z value and then work out y. 


Area .15. 


—1.036 0 


-15 = P(Z < 1.036) 


_ p( Y—500 
50 


= P(Y – 500 < 50(—1.036)) 
= P(Y «500— 51.8) 
= P(Y «448.2) 


Thus the lowest 15 percent of SAT scores in the population are 448 and below. 


<- 1.036) 


Example 4.8.9 
In the above population, what are the scores for the top 5 percent? 


.9500 


0 1.645 
0S = P(Z > 1.645) 


_ (У —500 
= P( z >1.645) 


= Р(Ү– 500 = 50(1.645)) 
= P(Y =500 +82.25) 
= P(Y =582.25) 
Thus the top 5 percent of the population have SAT scores of 582 and over. 


4.8 DESCRIPTIVE MEASURES IN PROBABILITY DISTRIBUTIONS m 143 


In Chapter 3 we considered the usefulness of the median as a descriptive 
measure of central tendency for a collection of data. For such a set of 
observations the median is the “middle” observation, or the mean of the two 
“middle” observations, in order of magnitude; it is thus a scale value that is 
exceeded by as many observations as it exceeds. With respect to a random 
variable Y and its probability distribution, we translate this notion into 
probability terms, just as we translated the notions of mean and standard 
deviation. We would like to define the median of Y as that y value that is as 
likely to be exceeded as not. 


In almost every case where Y is a discrete random variable, there is trouble 
in trying to identify a median by the above criterion, because the probability 
accumulates by jumps. Consider the random variable about the point rolled 
with a pair of dice. In Section 4.6 we set down the entire probability 
distribution of that Y. The following table shows what happens when we look 
for a y value that is as likely to be exceeded as not. For each y value we put in 
one column the total probability up to and including that y and in another 
column the probability of larger values of y; then we look for a y-value where 
the two entries are the same. 


„> ee 
Probability of y Probability of 
and smaller values larger values 


y 
1 0 1 
2 1/36 35/36 
3 3/36 33/36 
4 6/36 30/36 
5 10/36 26/36 
6 15/36 21/36 
7 21/36 15/36 
8 26/36 10/36 
9 30/36 6/36 
10 33/36 3/36 
11 35/36 1/36 
12 1 0 


_ “© s 0c 01r E 


We see that there is no y value that is as likely to be exceeded as not. Hence 
we must look for some kind of conventional rules by which we can select some 
number as a reasonable measure of “break-even” point. We do not want to 
enter that jungle in this book, and so we shall restrict ourselves to continuous 
random variables when we consider the median, or other scale values for 
splitting the probability distribution into different ratios. 


144 m STATISTICS AND CHANCE 


When Y is a continuous random variable, its entire probability mass is 
represented by the area under a continuous curve, and so we can precisely find 
a у value to cut the area at any proportion we want. It is of interest to define a 
whole set of cutting points for giving percentages of the total area. Such cutting 
points are called percentiles. 


fy) 


Probability density 
function of Y 


Улоо 


Thus the median of Y is the 50th percentile у, since 
P(Y Sy) = .50 = 50 percent 


whence 
P(Y7ys)-1-P(Yx y.s) = 1—.50=.50=50 percent, 
and, since Y is continuous, 
P(Y Су.) = P(Y < ys), 
so that we have 
P(Y X ys) =.50 = P(Y > ys). 


In some areas of application, special names are given to certain classes of 
percentiles, according to the following scheme: 


У25, Y.so, ys: first, second, third quartile; 
Уго, Уло, Уво, ув: first, second, third, fourth quintile; 
Уло, y20,..., удо: first, second, .. . , ninth decile. 


In this array we can note that the median is the 50th percentile or the second 
quartile or the fifth decile. 


48 DESCRIPTIVE MEASURES IN PROBABILITY DISTRIBUTIONS m 145 


Example 4.8.10 Normal distributions. 


Table A-3 gives immediately the percentile rank of the listed z values in the 
standard normal distribution: we have only to read P(Z <2) and move the 
decimal place. Thus for z — 1.3 we read P(Z =1.3) = .9032, and so 1.3 is the 
90.32th percentile of Z. Granted, it is not very practical to read this out loud! 
We can try “ninety point thirtytwoth percentile,” but it is perhaps better left 
unsaid, relying on statements like “somewhat higher than the ninetieth percen- 
tile" or “between the ninetieth and ninety-first percentiles.” 


For a general normal distribution we can find percentile rankings by translat- 
ing normal Y into standard normal Z, as we did earlier. Take the example of 
SAT scores which are N(500, 50). What percentile is the score 5257 


юа) 
Syn rad 


= 25 
* ң2 іа 50) 

= P(Z <0.5) = 6915. 
Thus 525 is just above the 69th percentile. 


Р(У =525) = 


We identify z values or y values for specified percentiles by running the 
above procedures in reverse. From Table A-3 we see that the 20th percentile 
of the standard normal distribution is —0.842, since we can enter the table 
where P(Z<z) lists .2000 and read out z=—0.842. Similarly the 90th 
percentile of Z is 1.282, the 5th percentile is —1.645, and the median (50th 


percentile) is 0. 


—0.842 о 


Translation Кот Z to Y will carry any such argument to the percentiles of 
any given general normal distribution. Again take the N(500, 50) SAT scores as 


example. What is the 20th percentile of this distribution? 
.20 = P(Z < -0.842) 


E MES. ~ ) 
=P( ao <-0.842 


= P(Y – 500 <–42.1) 
= P(Y <457.9). 


146 m STATISTICS AND CHANCE 


Thus the 20th percentile of the distribution is approximately the score 458. 
Similarly 


.90 = P(Z <1.282) = p(1500— 1.282) = P(Y – 500 =64.1) 
= Р(У =564.1) 
.05 = Р(2 =—1.645) = р(509. —1.645) = P(Y – 500 < -82.25) 


= P(Y x417.75) 


show that the 90th percentile is approximately 564 and the Sth percentile 
approximately 418. 


The following diagram is a frequently used description of the normal 
distribution. We start with the standard normal curve, take cutting points at 
z=+1, +2, +3, and use Table A-3 to get the indicated areas (probabilities). 
Using the relation (4.8.6): 


e - 6, when Y is М(џ, с), 


we can convert the z scale to a y scale for the general normal random variable, 
since the above equation gives 


ог = yf; у= р +02. 


MATT 
z5 


f z scale 
и и — 20 u-o и uta и + 20 и + 3o y scale 


99.74% 


It is because of the facts in this diagram that one frequently hears а 
Statement such as “їп a normal population, approximately 68 percent of all of 
the elements are within one standard deviation of the mean, about 95 percent 
of them are within two standard deviations of the mean, and all but about one 


fourth of one percent of them are within three standard deviations of the 


4.8 DESCRIPTIVE MEASURES IN PROBABILITY DISTRIBUTIONS m 147 


mean." Here we should keep in mind that “normal” means normal as defined 
by the normal distribution. And if “population” is a population of physical 
elements like people or television tubes or soap bars, then the normal distribu- 
tion can be only an approximation, since any collection of a finite number of 
elements has a histogram for its distribution graph, as we saw in Chapter 2. To 
say that such a collection has a “normal distribution” makes sense only if the 
collection is so large and so distributed that its histogram if plotted on a fine 
mesh of intervals is virtually indistinguishable from the normal distribution 


curve. 


EXERCISES 


48.1 If the random variable Y has the following probability distribution, what are 
the mean (ш) and standard deviation (т) of Y? 


y fly)=P(Y=y) 


1 0.20 
2 0.40 
6 0.30 
12 0.10 


4.82 Calculate the mean and variance for each of the following random variables: 


(a) (b) 
eee) oe 
у Қу) =Р(Ү=у) У ТИРУУ) 
3 ватар КЕ 
0 1/8 => 25 
3 1/2 zi .30 
6 1/8 0 05 
7 1/4 3 ло 
————— 5 30 


4.8.3 In Exercise 4.3.2 you calculated the probabilities for all possible numbers of 
oin. You thus constructed the probability 


heads in four tosses of a balanced c › 
distribution for the binomial random variable У = the number of heads in four 
= 1/2). Use your results from Exercise 


tosses (the parameters being n — 4 and p 
4.3.2 to calculate the mean (џи) and variance (о) of Y by the formulas for p 
and o? given by (4.8.1) and (4.8.3). Check your results by showing agreement 


with (4.8.5): the binomial distribution has р = np, o? = пр(1- p). 
having the binomial distribution with n — 10 and 


p — 0.4, calculate the mean (ш) and variance (о?) by the formulas for ш and c? in 
(4.8.1) and (4.8.3), using the probabilities derived from Table A-2. Check the 


results against the formulas in (4.8.5): и — np, o? = np(1—- p). 


4.8.4 For the random variable Y 


148 m STATISTICS AND CHANCE 


4.8.5 


4.8.6 


4.8.10 


Let Y be a random variable which follows a normal distribution with mean 
ш = 20 and standard deviation о = 2. What is the probability that a random 
observation taken from this population would be: (a) «21.0, (b) «18.6, (с) 
7 23.2, (d) between 17.4 and 22.4? 


The average seasonal rainfall in Town A is 50 centimeters, with a standard 
deviation of 15 centimeters. Assume seasonal rainfall to be normally distrib- 
uted. In a 75-year period, how many years would you expect to have between 
38 and 65 centimeters of rain? How many drought years would you expect, if 
drought is defined as seasonal rainfall of 29 centimeters or less? 


In a certain bakery process, the heights of cakes baked under standard 

conditions have a normal distribution with mean ш = 110 millimeters and 

standard deviation с = 10 millimeters. 

a. What is the probability that a cake from this population will be taller than 
135 mm? 

b. If 10,000 cakes were baked, how many of them would you expect to be: (a) 
higher than 122 millimeters, (b) lower then 100 millimeters? 


The acceptability of a capillary tube for a freezer is found by measuring the 
pressure drop in pounds per square inch between the two ends of the tube. The 
pressures obtained from a manufacturing process of capillary tubes show an 
average of 130 pounds per square inch and a standard deviation of 4 pounds 
per square inch. Assume that these pressures are random and normally 
distributed. Determine what: (a) percent of the pressures are below 121.6 
pounds per square inch, (b) percent of the pressure readings lie between 121.6 
and 134.4 pounds per square inch, (c) value is exceeded by 75 percent of the 
pressure readings, and (d) limits include the middle 90 percent of the pressure 
readings. 


Suppose that in a certain population the true mean systolic blood pressure is 

125 (millimeters of mercury) and the standard deviation of the distribution of 

individual blood pressures is 9. The distribution is considered to be normal. 

a. What proportion of the population has systolic blood pressure between 110 
and 135? 

b. What percentage of the population has systolic blood pressure in the ranges 
125+9, 125+18, and 125+ 27, respectively? 

c. What proportion of the population has systolic blood pressure: higher than 
145? as low as 100? 

d. If it is desired to classify in some special way (say Category C) the 5 percent 
of the population having the highest blood pressure, what is the criterion, in 
terms of blood-pressure reading, for putting an individual into Category C? 

Suppose that health authorities wanted to impose a 95 percent effective 

quarantine on persons bitten by the Anopheles mosquito. (We assume a normal 

distribution of incubation periods with и: = 14 days and о = 2 days.) They want 
to determine times y, and уг after exposure such that only 2.5 percent of cases 
will develop malaria before y: and only 2.5 percent of cases will come down 
after у;. Then the quarantine can extend from yı to у, days after exposure. 


Determine the quarantine period. Determine a 99.9 percent effective quaran- 
tine period. 


4.9 NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION m 149 


49 NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION 


In our discussion of the binomial distribution in Sections 4.5 and 4.8, and in 
the examples considered there, it must have occurred to the reader that any 
reasonably large number n of trials would require either enormous tables or 
enormous calculations, or both, to produce the probabilities of various num- 
bers of successes. This is in fact the case, and some of the earliest research in 
probability theory was directed to finding convenient yet close approximations 
to the exact probabilities. 

The most successful of these approximation attempts led directly to the 
standard normal distribution. Recall the nature of the binomial distribution. 
The random variable Y can be looked on as the number of “successes” in n 
independent trials of an experiment wherein the probability of success ina 
single trial is p. As set forth in (4.8.5), the mean p and standard deviation о for 


this Y are: 
ш = пр, с = Упр(1— р). 


The pertinent theorem states that the standardized" or “normalized” version 
of Y, namely, 


Ү-ь 


а 


> 


that is to say, 
У – пр 
vVnp(1— p) 
has a distribution which becomes more and more nearly like the standard 
normal distribution as n gets larger and larger. We say that the fraction is 
asymptotically normal, meaning thereby that 


«tribution Ob на елр c» (Uus ert (4.9.1) 
Distribution o ЖОЛОП (0, 1) 
In practical use, we stop far short of infinity and take N(0,1) as an 
approximation to the distribution of the indicated fraction. The goodness 
(closeness) of the approximation depends on both n and p. Experience has 
shown the approximation satisfactory in all cases where both np and n(1—p) 
are greater than 5. 


150 m STATISTICS AND CHANCE 


We summarize all the foregoing in the procedural rule: 


If Y is binomial with parameters n and p 
such that np > 5 and n(1— p) 5, then 


Упра 
Vnp(l-p) ` 


where Z is standard normal N(0, 1). 


(4.9.2) 


Example 4.9.1 


A baseball player who has a .300 batting average would be "expected" to 
have six hits in 20 turns at the plate. Assuming actuality to satisfy the 
conditions for the binomial distribution (independent trials, constant hit proba- 
bility), what is the probability that he will get no more than six hits in the 20 
times at bat? 


Taking У = the number of hits, we have Y binomial with n = 20, p = .30, and 
we want P(Y <6). From Table A-2, the exact probability (to four decimal 
places) is 0.6080. 


Since here np = 20(.3) = 6, n(1— p) = 20(.7) = 14, and the values 6 and 14 аге 
both greater than 5, the approximation (4.9.2) is justified, and we could argue: 


ш = np = 20(.3) = 6, а = Ynp(1— p) = /20(.3)(.7) = /4.20 = 2.05; 


а бб Or Ou ду 
PUSS x 2.05“ 205)“ (2575) ДА Co 
the final probability figure coming from the standard normal distribution (Table 
A-3). 


So our approximation gives the answer .5000 to a question whose exact 
answer is .6080. Not terribly bad, considering the relatively small value of n 


and of np. But the approximation can be greatly improved by a small adjust- 
ment. 


The adjustment made when applying the approximation (4.9.2) to a specific 
case is often termed a correction for continuity. Both the name and the method 
of making the correction are clarified by observing the motivation for the 
correction. The standard normal distribution is represented by a continuous 
curve, while the binomial distribution is represented by a histogram. Our 
approximation of a binomial probability is thus an approximation of an area 
made up of rectangles by an area under the continuous normal curve. Thus by 


reference to the diagram, we see the reasonableness of a “correction” 10 
improve the approximation: 


49 NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION m 151 


Y: Уг 


Ен – (1 +9- пр 
а er 


) 
P(Y = уз) = -r(z 2 m (4.9.3) 


eve iptum Op 
“= Масса 


The correction for continuity is the adjustment 1/2 made in the manner shown 
in the foregoing three statements. 


Example 4.9.1a 

We improve the approximation in our calculation of probability of no more 
than six hits by using a small rough sketch to show us how to adjust the 
caleulation in accordance with the correction for continuity. We want to 
approximate that area of the histogram that is indicated by shading, and so we 
see that the area under the approximating curve should be taken to the 
boundary 6.5. Then we calculate: 


cS 65-б) р P( <25) = P(Z<0.24). 

la i dt 95 р(2 4705) = 92205)“ Р ) 
Table A-3 shows the answer to between .5793 and .6000. (Taking 0.5//4.20 = 
0.5/2.049 = .2440 and using more elaborate standard normal tables, we would 

have the answer .5964.) This is now very close to the exact answer .6080. 


152 m STATISTICS AND CHANCE 


Example 4.9.2 

In a certain kind of animal mating, the probability is 75 percent that the 
offspring will have normal coats. What is the probability that 76 independent 
matings will yield at least 50 offspring with normal coats? 


Take Y = number of offspring with normal coats, and consider Y to be 
binomial with n = 76 and p = 3/4. Since пр = 76(3/4) = 57, n(1— p) = 76(1/4) = 
19, and these values are both greater than 5, the approximation (4.9.2) is 
applicable. 


ш = np = 76(3/4) = 57, 


а =Vnp(1—p) = У76(3/4)(1/4) = /14.2 = 3.77; 


Y-57. 50- E 
872. 379 


P(Y250)- [| 


kr 25-57) н 
~P(z =a (See diagram below) 


as Еко й zi 
= R(z»525)- ez 199. 
Taking one decimal place in y so as to match Table A-3, we have 


P(Y 250)«P(Zz-2.0)- 1- P(Z« -2.0) = 1—.0227 = .9773. 


The exact probability, to four decimal places, as given in extensive tables of the 
binomial distribution, is .9735. 


49150 51 57 
| 


1 
49.5 


Example 4.9.3 


) In Section 4.3 we reported on two experiments of rolling a pair of dice 504 

times, keeping track of the cumulative proportion of rolls that gave point “7.” 
We commented that 504 rolls were not nearly enough to give high likelihood 
of ending up with the proportion being within .01 of the theoretical probability 
1/6. What is the probability of being within .01 of 1/6 when we observe the 
proportion of 7s in 504 rolls? 


4.9 NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION m 153 


To be within .01 of 1/6 when we calculate the observed proportion of 7s 
means that, if Y =the number of 7s in 504 rolls, we have 


1 ОВ! 
601 20476 0b 


that is, multiplying all members by 504, we require: 
84—5.04 € У <84+ 5.04, 
78.96 = Y = 89.04. 


Since the number of 7s has to be a whole number, the boundaries on Y 
consistent with the above are 


79x Y «89. 
Taking Y as binomial with n — 504 and p = 1/6, we have 
џ = np = 504(1/6) = 84, 
с = Упр(1-р) = 4504(1/6)(5/6) = У70.0 = 8.37; 


79-84 Ү-84 89-84 
P(9-Y«89-P| $57; ^ 837 < 8.37 ) 


18.5-84 < z 898.94) 
КОЖА ei 


E 5.5 
д Hy <z) 


=р(-0.7=2=+0.7) = 7580 – .2420 = .5160. 


154 m STATISTICS AND CHANCE 


EXERCISES 


4.9.1 


4.9.5 


4.9.6 


4.9.7 


In the case of certain animals, the probability that an offspring is pure white is 25 
percent. What is the probability that among 300 offspring, the number of 
pure white will be: (a) no more than 67, (b) more than 90? 


According to the latest published mortality tables, about 23 percent of persons 60 
years of age die before reaching age 70. What is the probability that of a group 
of 2000 persons of age 60 the number of deaths before 70 will be at least 450 
and not more than 500? What would you say about the group if the number of 
deaths actually turned out to be 420? 


If a type of surgical operation having a 90 percent probability of success is 
performed 100 times, what is the probability that there will be more than 11 
failures? 


It is believed that 20 percent of the voters in a certain community are Indepen- 
dent voters. A poll is taken of 400 voters constituting a random sample. Of 
these, 95 state that they are Independent. What is the probability of a response 
as large as this if the 20 percent proportion is correct? 


With the standard process of manufacturing a certain article, 2 percent of the 
produced units are defective. A new time- and money-saving process will be 
installed if it does not significantly increase this proportion of defectives. A test 
run of 2500 units produced by the new process shows 55 defective. What is the 
probability of a number of defectives as large as this if the 2 percent defective 
rate has been maintained? 


The Mecca College basketball team has a long-run record of 60 percent wins. 
What is the probability that it will not do better than break even in next season's 
24 games? 


Repeat the argument of Example 4.9.3 using 5400 rolls of the dice, and thus 
confirm the statement made in Section 4.3 that "something over 5000 rolls are 
required to give 95 percent probability that the observed proportion (of 7s) will 
be within .01 of 1/6." 


51 THE USE OF A 
RANDOM SAMPLE 


When we use the data of a sam- 
ple to tell us something about the 
population from which the sample 
was taken, the only thing we can 
be sure of is that we do not have 
the whole truth about that popula- 
tion. A frequency distribution or a 
histogram of the y values in the 
sample can give us a good idea 
about the probability distribution 
of the characteristic Y in the popu- 
lation, but it cannot give us the 


exact distribution of Y. The mean y 


of the sample gives us an estimate 
of the population's mean p, but y 
can never give us ш precisely. The 
same is true of the sample standard 
deviation s: s is never а; it is an 
estimate of o. 

In such a state of logical ignor- 
ance, the best we can do is to 
hedge our estimates in some scien- 
tific manner that takes chance into 
account. Then at least we can have 
a high degree of confidence that 
the truth lies within certain 
specified limits. This is the purpose 
of that part of statistical inference 
called statistical estimation. It is 
one reason that statistical inference 
is sometimes defined as decision- 
making in the face of uncertainty. 

When a gambler places a bet on 
the roll of a pair of dice or on the 
outcome of a horse race, he is mak- 
ing a decision based on the odds 
assumed to hold for the various 
outcomes—a judgment based on 
the probability distribution of the 
chance process. So it is with a 
statistician making an estimate 


155 


Educated 
Guessing 


156 m EDUCATED GUESSING 


about a population; he makes a decision based on the probability pattern 
assumed to govern sample estimates. 


In one important sense the statistician is worse off than the gambler. After 
the dice are rolled or after the horse race is run, the gambler knows exactly 
whether his decision was right or wrong. In most cases the statistician can never 
know whether his hedged estimate is right or wrong. Even if he estimates voter 
preferences, the actual outcome of the election will not tell him whether he was 
tight or wrong at the time of his sampling. He must rely on an estimation 
procedure that has a very high probability of giving a correct bracket each time 
it is used with a single sample. 


Mathematicians who specialize in the probability theory underlying samples 
have worked out the probability patterns governing estimates in a wide variety 
of situations. In all cases of practical usefulness, these patterns are based on 
having the sample be a random sample. 


We present a method for drawing a sample which in most practical situations 
will satisfy the requirements. The important thing to keep in mind is that our 
methods of educated guessing about the limits to put around an estimate will 
be valid only if the estimate comes from a random sample. 


5.2 DEFINITION OF A RANDOM SAMPLE 


In order to get an observation y on the random variable Y, we operate the 
chance process that governs the distribution of Y and observe the result. We 
roll the dice and observe the resulting point. We toss the coins and observe the 
number of heads. We inspect a television tube taken from a production line 
and see whether it is defective or nondefective. We take a capsule from the 
military draft lottery bowl of birthdates and read the enclosed date. We ask a 


laboratory to analyze a vial of blood and discover the cholesterol content. We 
select a student file and read the G.P.A. 


terms, this means that the single trial that we conduct must be just as likely as 
any other trial that we could conduct. 


A fair roll of a Pair of dice or a fair toss of three coins can reasonably be 
assumed to meet the above requirement. For a production line of television 
tubes, the bowl of birthdate capsules in the draft lottery, a collection of vials of 
blood, or a cabinet of student files, we Satisfy the requirement by using some 
selection Procedure that makes each element (tube, capsule, vial, or file) as 
likely to be drawn as every other element. 


5.2 DEFINITION OF A RANDOM SAMPLE m 157 


A random observation made in such a manner is a random variable: its value 
is unknown before the trial is made, and the value that appears in the trial is 
determined by the probability distribution of the population random variable 
Y. The random observation is a Y. If it is our first observation in a series of 
observations, we can show this by subscripting Yı. After the trial we know the 
outcome and call it yı. 


If a second observation is made in the same manner as the first, it is another 
Y random variable; we can call it Y2. Y, and Y; are called independent 
observations if the outcomes of the first and second trials are both independent 
events in the sense mentioned in Chapter 4. 


We now extend the process to the third, fourth, . .., nth observations. In this 
way we have n independent random observations Yi, Y»... Yn, each gov- 
erned by the probability distribution of Y. The outcomes of the n independent 
trials are the observed numerical values yi: у» ·· ·.› Yn Such a set of random 
observations is called a random sample of size n from the population of Y. 


DEFINITION 5.2.1 А random sample of size n from the population of 
Y is a set of n independent random variables YogYou e Y, reach 
having the probability distribution of Y. 


АП the foregoing discussion presupposes that the number of possible obser- 
vations is theoretically unlimited. This is true of dice rolls or coin tosses. It 
cannot be true if the trial involves a distinct member of a finite collection. 
Television tubes, draft-lottery capsules, vials of blood, and student files are not 
infinite in number. Here we get into complications that require advanced study 
in sampling. In such situations one talks about a random sample from a finite 
population and defines that as a set of n elements drawn from the finite 
population in such a way that each possible set of n is as likely as every other 
possible set of n. When n is a very small fraction of the total number of 
available elements, the sample behaves in a manner very close to that of our 
definition above. In this book we shall limit ourselves to such cases. 


158 m EDUCATED GUESSING 


5.3 DRAWING A RANDOM SAMPLE FROM A 
FINITE POPULATION 


Our intuition gives us a reasonably good idea of how to draw n items at 
random from a collection of items; label the items in some identifiable manner, 
put the labels in a box, shake the box energetically, and then draw out n labels 
"at random." But those last two words are more easily said than done. The 
hand grabbing can have a bias for the top of the pile, the bottom of the pile, 
this or that side of the box, or a pattern of spacing grabs. The more consciously 
we try to work out a system of random selections, the less likely it is to be 
random. 


Scientists realized long ago that it would be preferable to have a table of 
random numbers assembled from the most refined hand grabbing possible and 
tested for absence of patterns or other bias. Such a table could be an extremely 
long sequence of digits, each of which is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9, and 
each of which was the result of a random drawing wherein the 10 possibilities 
were equally likely. We could print this sequence of digits in columnar form, 
column after column, and on as many pages as necessary. Then any random 
Start in the table would give us a random sequence of digits from there on. We 
could read the columns two at a time to give us two-digit random numbers, 
three at a time for three-digit random numbers, and so on. 


A small-scale procedure for producing a random sequence of digits could be 
the following. In each of 100 capsules place a slip of paper marked 0, in each 
of 100 other capsules place a slip marked 1, in each of another 100 capsules 
place a slip marked 2, and so on by batches of 100 capsules until we have 1000 
capsules representing the digits 0-9 in equal proportions. Put the capsules in a 
large box, shake the box vigorously, and draw out a capsule. Record the digit 
in the capsule, replace the capsule in the box, shake the box again, and draw 
out a capsule. Record that digit, replace the capsule, shake the box, and draw 
out a capsule. Continue on and on as long as interest and strength allow. 


The earliest published tables of random digits were constructed by some 
variation of the above physical process. After the invention of high-speed 
electronic computing machines, programs were devised to command such a 
computer to produce sequences of random digits. This is now the standard 
procedure for Producing sequences of random digits. The best known of 
published tables is A Million Random Digits, authored by the RAND Corpora- 
tion (Santa Monica, California) and published by the Free Press (Glencoe, 
Illinois) in 1955. The volume is often referred to as just the RAND Table. The 
sequence of 1,000,000 random digits is printed in 20,000 rows of 50 digits 
each, fifty rows to a page for 400 pages. One page of this table (chosen at 
random!) is reproduced as our Table A-6 in the Appendix. 


53 DRAWING А RANDOM SAMPLE FROM А FINITE POPULATION m 159 


Rows and columns are printed in blocks of five for ease in reading. Row 
numbers are given at the left; column numbers must be counted off by the 
reader. Our Table A-6 is seen to contain rows 03000-03049 of the RAND 
Table; these appear on the RAND Table's page 61, the randomly chosen page. 
To check the numbering of columns, take row 03034 as example and note that 
the digit in column 1 is 8, the digit in column 2 is 1, the digit in column 3 is 8, 
the digit in column 4 is 0, and so on; the digit in column 37, for example, is 9. 


To use a table of random digits, one must take a random starting point in the 
table and then read a sequence from there. One may read across rows, going 
forward or backward; he may read along columns, down or up. To get a 
sequence of random numbers he may take the digits in blocks of two, three, 
four, and so on—as many digits as required to fit the largest item: number he 
has to accommodate. For example, if the largest number in the set from which 
we are going to draw is 27,654, then we are going to have to read digits in 
batches of five, so as to allow us to get numbers up through 27,654. Numbers 
with fewer digits will appear like 00218, 00006, and so on. 


There are various devices for choosing the random starting place. In Table 
A-6 a reasonable procedure is as follows. There are in this table 50 rows (row 
numbers ending 00-49) and 50 columns (we would count them off 1, 
2,...,50). So we have to choose at random a pair of numbers to determine 
our starting point, one number between 00 and 49 to fix the row, and one 
number between 1 and 50 to fix the column. With eyes closed, let a finger pick 
a point on the page. With eyes opened (!) take the five-digit block of numbers 
nearest your finger as a code number for the starting place. For example, 
consider what happened to one of the authors: his finger came nearest the 
block 57429 (it is the fifth block in row 03012). We take the first two digits of 
the five-digit block to tell us a row number, reducing by 50 if necessary to get a 
number between 00 and 49; in our example this gives us row 07 (57 minus 50). 
We take the next two digits in the block to give us a column number, again 
reducing by 50 if necessary to get a number between 01 and 50; in our 
example we get column 42 directly. The resulting row and column numbers 
then specify the starting place for our random sequence. In our example we 
have row 07, column 42. The reader should verify that the resulting sequence, 
if one reads down the column, starts out 4, 0, 9, 4, 4, 2, 3, 6,...; and if one 
reads left to right along a row, the sequence starts out 2, 2„ 6, 8, 9, 2, 9, 5, 
0,.... We pursue a sequence from one column to the next if we are reading by 
columns and from one row to the next if we are reading by rows. Thus the 
row-wise sequence above would be extended to read 4, 2, 6, 8, 9, 2, 9, 5, 0, 7, 
28:10. 9. eie 
to select a random sample of nine students 


Let us now use this procedure ne sti 
in Chapter 1. The students are individually 


from the 180 students considered 


160 m EDUCATED GUESSING 


identified by number, 1-180. We must therefore choose at random nine 
numbers from the collection (1,2, 3, ..., 180). This means that we must use 
random digits in blocks of three, so that numbers up through 180 can appear, 
There will of course be many random numbers which we will have to pass 
over—any three-digit block giving a number larger than 180. We shall also 
pass over any random number that we have already used. 


Again permit the authors to finger the starting point code to set up the 
example. This time the code block turned out to be 40746 (eighth block in row 
03001). This gives us the start at row 40 and column 24 (74 minus 50) 
Reading down a column seems to us the easiest way to read figures, and so we 
shall proceed that way. From the starting digit the sequence looks as follows on 
the left, and is read as shown to the right: 


98 0 980 
83 5 835 
310 310 
818 818 
142 172 
08 9 089 
81 1 811 
66 1 661 
77 1 77 
15 8 158 


Up to this point we have produced three usable numbers: 172, 089, and 158. 
We must continue on, starting at the top of the next column. There we read 


783, 149, 584, and so on. The complete sequence is as follows, with usable 
numbers starred. 


980 *158 458 465 328 
835 783 960 598 458 
310 *149 702 235 370 
818 584 300 980 552 
*172 282 288 356 670 
*089 401 686 *156 589 
811 *120 342 289 *074 
661 339 *017 987 
771 266 548 *109 


53 DRAWING A RANDOM SAMPLE FROM A FINITE POPULATION m 161 


Thus our random sample of nine students is composed of students number 
172, 89, 158, 149, 120, 17, 156, 109, and 74. 

Suppose our interest is in the G.P.A. We make the observations from the 
information in Chapter 1, and then can set down our sample as follows (with 
the y? column added for calculation of 5): 


Student G.P.A. 


Number y у? 

172 1.00 1.0000 

89 3.36 11.2896 
158 2.17 4.7089 
149 1.93 3.7249 
120 3.19 10.1761 

17 1.13 1.2769 
156 2.91 8.4681 
109 2.73 7.4529 

74 2.40 5.7600 


20.82 53.8574 


The reader should confirm that this sample gives 


2 
> 53.8574- 2932 
у= 7082 = 2315; Баша s = 0.844. 


The question now is: How close to the population mean р and standard 
deviation с can we believe these values to be? 


EXERCISES 


5.3.1 In Table A-6, confirm that the method described in the text for choosing а 
random starting place in the table will give the first number to be read as shown 


in the following examples. 


Starting Point Number of Digits First Number 


Code Number per Block to be Read 

(a) 04187 3 677 
(b) 28351 2 27 
(c) 72224 3 358 
(d) 94452 4 5202 
(е) 13700 3 229 
(f) 02831 4 8127 
(g) 85790 3 065 

50662 2 81 


162 m EDUCATED GUESSING 


5.3.2 Using Table A-6, draw sets of random item numbers as specified in the following 


examples. 
Item Numbers Required Size Starting Point 
in Total Set of Sample Code Number 
— __-  ———_——_—————_—____ 
(а) (or ee Л 7,873; 6 30170 
(b) 1,2,3,...,4618. 5 69033 
(c) 1,2, 3,..., 659. 7 47951 
(d) 1201, 1202, 1203, . . . , 7890 8 56704 


——————————————— 


Draw a random sample of 16 students from the Mecca Community College class 
(Table 1.2.1). Set down the data on these 16 students, and give descriptive 
statistics as follows: (a) proportion female, (b) proportion having no party 
preference, (c) proportion in favor of legalizing marijuana, (d) mean scale score 
on the marijuana question, (e) mean commuting distance, (f) mean G.P.A. 
average, and (g) G.P.A. standard deviation. An interesting survey can be made 
by comparing your results with those of the other members of your class. 


A listing of 59 Accounts Receivable ledgers is shown in Table 5.3.1. The data 
are the total dollar amounts, rounded to the nearest thousand of dollars, 
contained in each of the ledgers. 


à. Construct a frequency distribution of these accounts, using intervals of one 
thousand dollars starting at 6,500. 


b. Construct a cumulative percentage frequency distribution. Using this distribu- 
tion, give the median dollar amount in the 59 ledgers. 


c. Calculate the mean dollar amount in the 59 ledgers. 


d. Using a table of random numbers, select two samples of size n = 9 each. For 
each sample, calculate its mean. 


€. When the exercise is finished, pool your two sample means with those of the 
other class members and plot them on the same scale as used in (a) above. 


f. Compare the distribution in (a) with that found in (e). 


5.4 THE PROBABILITY DISTRIBUTION OF A SAMPLE MEAN m 163 


TABLE5.3.1 Listing of Accounts Receivable 
Ledgers in Ledger Number Order 


11,000 21 13,000 41 11,000 
14,000 22 9,000 42 8,000 
10,000 23 12,000 43 11,000 
11,000 24 11,000 44 14,000 
10,000 25 8,000 45 11,000 
16,000 26 8,000 46 9,000 

9,000 27 9,000 47 10,000 
11,000 28 10,000 48 12,000 
10,000 29 10,000 49 12,000 
10 9,000 30 8,000 50 12,000 
11 9,000 31 10,000 51 9,000 
12 10,000 32 11,000 52 9,000 
13 11,000 33 10,000 53 10,000 
14 9,000 34 11,000 54 8,000 
15 13,000 35 9,000 55 9,000 
16 11,000 36 9,000 56 9,000 
17 8,000 37 11,000 57 7,000 
18 9,000 38 11,000 58 10,000 
19 11,000 39 12,000 59 10,000 
20 8,000 40 10,000 


CMNOORWN ~ 


54 THE PROBABILITY DISTRIBUTION 
OF A SAMPLE MEAN 


Since the observations in a random sample are random variables, the mean 
of those observations is a random variable. With our capital-letter convention, 
we can write 
Yit Y+: e+ Yn 

n 


y-àXY- 
n 


When we draw a random sample of size n from the population of Y, we then 
obtain the observed values yi, yz; .. ·, Yn Calculating the mean of these values 
gives us the value of ӯ for the sample mean. This value of y is thus the 
outcome of one observation on the random variable Y. Take another random 
sample of size n from the population of Y and we shall end up with a y value 
different from the first. And so on. That is, there is a population of y-values 
just as there is a population of y-values. Thus we speak of the probability 
distribution of the sample mean Y. 


With somewhat more mathematics than we want to use in this book, we 
could work out various important facts concerning the probability distribution 
of Y. With still more advanced mathematical methods we could derive one of 
the fundamental laws on which much of statistical inference is based. Here our 
interest is in applying such facts and laws, and so we shall simply state and use 


164 m EDUCATED GUESSING 


them, leaving derivations for the reader to consider if someday he makes a 
more detailed study of statistics. 


If Y has mean w and standard deviation о, then, no matter what is the 
specific form of the probability distribution of Y, the probability distribution of 
Y has the following mean and standard deviation: 


pee | de by =p 
Standard deviation of У: oy =a/Vn (5.4.1) 


Here we see two of the basic facts that make the sample mean so important 
in statistics: the mean of the probability distribution of Y is precisely the same 
as the mean of the population of Y, and the standard deviation in the 
distribution of Y is smaller than that in the population of Y as soon as the 
sample size n is larger than 1. This means that Y varies about an expected 
value that is the same as the expected value of Y, and its variation is smaller 
than the variation in Y. Moreover this variation in Y becomes less and less as 
the sample size is made larger and larger. This makes intuitive sense, for we 
would anticipate that an infinitely large sample should give a y that would hit p 
on the nose, and this is indicated by the fact that a/A/n, measuring the variation 
in Y, goes to zero as n goes to infinity. 


Information about the form of the probability distribution of Y is contained 
in the following two important theorems. 


Theorem 5.4.1 


If Y has the normal distribution with mean џи: and standard deviation о, 
then Y has a normal distribution, with mean ш and standard deviation 
(a]v n). In shorthand notation, 


if Y is N(p, о), die Y is n(m 5). 


The mean and standard deviation of Y in this theorem аге simply in 


accordance with (5.4.1); the important new fact is that Y has a normal 
distribution if Y has. 


The next theorem is one of the most important, and remarkable, laws in 
probability theory. It shows that the distribution of Y, as n is made larger and 
larger, has a tendency to become more and more nearly a normal distribution 


kc matter what nonnormal distribution Y has. This law is called the central limit 
theorem. 


5.4 THE PROBABILITY DISTRIBUTION OF A SAMPLE MEAN m 165 


Theorem 5.4.2 (Central Limit Theorem). 


If Y has а nonnormal distribution with mean џ and standard deviation 
с, then, as n increases without bound, the distribution of Y approaches 
the normal distribution with mean ш and standard deviation (a/Vn). 


Using the formula for transforming a normal random variable into the 
standard normal random variable Z, we can summarize the above facts in the 
following operational rule: 


Y-u(=Z if Y isnormal, 


4.2 
= )=Z if Y is nonnormal and n is large, (а) 


where Z has the standard normal distribution N(0, 1). 


How large is “large” for n? Much effort has gone into the study of the 
closeness of the approximation for various values of n. There are a number of 
rules of thumb but no universally held standards, since so much depends on the 
specific form of the distribution of Y and on the criterion of “closeness” which 
one adopts for assessing the approximation. Figure 5.4.1 indicates how 


rapidly the distribution of Y approaches the normal; it suggests that n 230 
is large enough to bring the distribution of Y close to normal. 


Population 


Population 
Population Population 
Nh 
Values of y Values of y Values of y Values of y 
Sampling 
distribution of у тете E Sampling Sampling 
У distribution of у distribution of ӯ 


N 


Values of у Values of у Values of у Values of у 


Sampling 


ано, = Sampling 
distribution of y cell: = line 
= distribution of y тое, > Sampling 
n*5 distribution of y 
п = 5 wee 
Values of у Values of y Values of y Values of y 
Sampling 


_ Samp "| Sampling 
distribution of y distribution of y Sampling 


distribution of y 


Sampling _ 
distribution of y 


Values of у 


Values of y Values of y Values of y 


FIGURE 5.4.1 Distribution of the sample mean for various populations and sample sizes. 
Adapted with permission from Kurnow et al. Statistics for Business 


Decisions, Richard D. Irwin, Inc., Homewood, Illinois, (1959c), PP. 
182-183. 


166 


55 A CONFIDENCE INTERVAL FOR p WHEN с IS KNOWN ш 167 


55 A CONFIDENCE INTERVAL FOR и 
WHEN o IS KNOWN 


The relation (5.4.2) is a statement about the deviation of Y from the 
population mean p. It shows that if we know с we can study the probability 
behavior of the deviation Y – p by using the standard normal distribution of Z. 


When an investigator uses Y as an estimator of m, he would like to have 
some bound on the magnitude of the deviation that has a high probability of 
not being exceeded. He could hope, for example, that there is 95 percent 
probability that the deviation will not exceed a certain reasonable amount. If 
he gets that level of assurance, he could then have 95 percent confidence that 
the y which actually occurs in a sample is within the specified distance from p. 

The distribution of Z (Table A-3) tells us that the central 95 percent of the 
probability of z-values is bounded by z =—1.96 and z= +1.96. 


® 975 


translate into an interval concerning Y and p, by 


This z interval we can now 
ue our probability statement as 


using (5.4.2) and a bit of algebra. We purs 
follows. 


95 = P(-1.96 < Z < 1.96) 
y= 
= -196<—Ё#<1.96) 
ң тугч 
а > а 
= P| -1.96—=< Y — «196 
pio es ние 
а 


e c = 
-p[-Y-196-—2-«- <-Y+1.96 ) 
ғ Y 1997 55^ n Ta 


= а => с 
= .96—=> 27-1969), 


168 m EDUCATED GUESSING 


that is, 
а 


95=Р(7–1.96 си < 7 +1.96 7.) (5.5.1) 


This statement tells us that there is 95 percent probability that Y— 
1.96(o/ n) and Y + 1.96(o/ Vn) will bracket the population mean и. This means 
that in the long run of repeatedly taking random samples of size n from the 
population of Y, 95 percent of the time the sample will give a y such that 
¥—1.96(a/Vn) and у +1.96(оу/п) bracket the mean p. Thus when we draw 
just a single sample, we are willing to bet 95:5 that its y is the kind that gives a 
bracket of ш by 7+1.96(c/Vn). We express this by saying that we have 95 
percent confidence in stating that p lies between y— 1.96(o/Vn) and у+ 
1.96(a/Vn). We call the bracket interval а 95 percent confidence interval. 


Example 5.5.1 


Suppose that we know that a certain random variable Y has a standard 
deviation of 16: с = 16. Suppose next that we take a random sample of 64 
Observations on Y, calculate the mean of the resulting values, and find that to 
be 142.61: у=142.61. We can now proceed as follows. 


The limits of a 95 percent confidence interval for ш аге 


142.611.96(-75-) =142.61+ 1.96(28) 
= 142.61+1.96(2) 


= 142.61+3.92. 


Then by using the minus and the plus values, we get 142.61— 3.92 — 138.69 as 
the lower limit, and 142.61--3.92 — 146.53 as the upper limit, so that we can 


state that a 95 percent confidence interval for the population mean p is 
138.69 < u < 146.53. 


This tells us the limits (138.69, 146.53) between which we believe u lies, and 


the degree of confidence (95 percent) that we can have in the procedure that 
gave us the limits. 


5.5 A CONFIDENCE INTERVAL FOR » WHEN с IS KNOWN m 169 


We can of course arrange for any degree of confidence other than 95 percent 
by simply changing the z value from 1.96 to that value in Table A-3 which 
gives us the desired area under the standard normal curve. We can state the 
general confidence interval as follows. 


MW 


0 EU 


Example 5.5.2 


Suppose that in the situation of Example 5.5.1 we want to have a 99 percent 
confidence interval for џ. With .99 area in the center of the z distribution, 
there is area .005 left out in the left tail and area .005 left out in the right tail, 


so that, according to Table A-3, 


—z&72057-—2.58 and z4-2955— 42.58. 


Then the limits of a 99 percent confidence interval for ш are 
16 
142.61+2.58 26) = 142.61 2.580 
Wr P 
=142.61+5.16, 


and our 99 percent confidence interval for ш is 137.45 Си < 147.71. 


170 m EDUCATED GUESSING 


Notice that this interval is wider than the 95 percent confidence interval in 
Example 5.5.1. This matches common sense, since we should expect to be 
forced to enlarge the interval around y if we want to increase our confidence in 
the bracket. Conversely we can shorten the confidence interval by reducing our 
degree of confidence. 


The standard deviation of the sample mean Y has been given the special 
name standard error of the (sample) mean: 


Standard error of the (sample) mean = standard deviation of Y 


2 
| 
zn 


With this nomenclature, one may find it useful to look on the limits in (5.5.2) 
as ӯ plus-minus so many standard errors." The “во тапу” is determined from 
the tables of Z to give the degree of confidence we want. 


5.6 REQUIRED SAMPLE SIZE 


One of the most common questions asked of a statistician is “Ном large a 
sample should I take?" The answer depends right away on what the questioner 
wants to do with the data. When he wants to estimate the population mean p, 
the logic for the answer goes as follows. 


In (5.5.2) we see z«(o/ Vn) as the allowance to place around y in order to 
give us a confidence interval for u. We can then have c percent confidence that 
ш lies within that distance from у. If we specify how close we want the estimate 
to be, we are specifying the value of zx(a/Vn), and we have the beginning of an 
equation to solve for n. Let us designate the specified distance by d. Then our 
requirement for sample size is 


With d specified, we still must know zx and c in order to solve for n. This is 
why the statistician asks for answers to the following questions before he 
ventures an opinion as to what sample size is needed. 


a. How close to д do you want to be? (The answer is d.) 


5.6 REQUIRED SAMPLE SIZE m 171 


b. How sure do you want to be that you are that close? (The answer gives c 
percent, and from that will give zx.) 


c. What can you tell me about the variability in the characteristic which you 
are measuring? (The answer is needed for deciding on a value for o.) 


The last question is of course the most difficult to answer. Past experience 
with similar studies may provide reliable information about ø. АП else failing, 
the statistician may ask “What are the smallest and largest values ever 
observed for Y?" Here he has in mind the picture of the normal distribution, 
where virtually all (99.74 percent) of observations fall within +3 standard 
deviations of the mean. That suggests a spread of бе between the smallest and 
largest predictable observations. He could then in his desperation take o as 
one-sixth of the distance between the smallest and largest y-values еуег 
observed. 


Example 5.6.1 


The Hoboken plant is producing 1-pound cans of Maxban Cottage Coffee. 
The net weight of coffee on the packing line has been set by Manufacturing 
Standards as ш = 16.04 ounces. Historically, the weekly net weight records 
have shown that packing line number 6 has a standard deviation of с = .03 
ounce (i.e., the net weight from can to can varies with a с = .03 ounce). For 
management and quality-control reasons, it is desired to estimate the true 
weekly average packed weight =.01 ounce, with 95 percent confidence. How 
many 1-pound cans of coffee will have to be randomly chosen each week in 
order to guarantee these specifications? 


n — sample size — ? 
g —.03 
Zx= 1.96 (normal deviate from 5 percent tail area split 2.5 percent 
in each tail) 
d=.01 
Therefore 


ZRU 01 


=, 


n 
(1.96)(.03) — 
ix 01 

1.96)(.03) 
os 
5.88<Vn 
(5.88) <n 
34.57<n 


172 m EDUCATED GUESSING 


Thus n = 35 sample cans will be taken at random throughout each week on 
line 6, the net weight of coffee determined for each can, and the average of all 
35 cans calculated. On the assumption of a normally distributed net coffee 
weight for 1-pound cans, the true mean weekly net weight will be estimated 
within 3.01 ounce with this sample of n = 35 cans. 


Example 5.6.2 
Consider again the situation in Examples 5.5.1 and 5.5.2. There, with о = 16 
and n = 64, we can be 95 percent confident that p is within +3.92 of y, and 99 
per cent confident that џ is within +5.16 of y. Suppose that we want to be 99 
percent confident that u is within +3 of y. Then we have 
d=3; Z* 2.58; o=16; 

and (5.6.1) gives 

(25816) 4, 

n 
(2.58)(16) 


geret 


that is, 
Vn= 13.76, 
nz 189.3376. 


Since the sample size must be a whole number, we see that the smallest sample 
size that will accomplish what we want is п = 190. 


It is worth noting the effect of each of the three conditions d, zx, and с on 
the sample size requirement. We can see this in general form if we solve the 
inequality (5.6.1) to give a general formula for n: 


ZO = 4Уп 


giving 
20 |? 
п> [2:2] : (5.6.1) 


Неге we can see that the required sample size: (a) increases as the desired 
degree of confidence increases (since increasing confidence demands increasing 
2%), (b) increases as the standard deviation increases, and (c) decreases as the 
allowable tolerance d increases. Moreover (5.6.1') shows that any of these 
effects is in proportion to the square of the determining factor. 


5.6 REQUIRED SAMPLE SIZE m 173 


EXERCISES 


5.6.1 


5.6.2 


5.6.5 


5.6.4 


Measurement errors in laboratory tests are usually normally distributed. Suppose 

a certain laboratory test has a g = 1.4. 

a. Find: (i) a 95 percent confidence interval for the true value of the determina- 
tion using a test that comes out 17.5, (ii) a 99 percent confidence interval, and 
(iii) the lengths of these intervals. 

b. Suppose now that the experiment is run five times and the mean is used to 

find the confidence interval. What Will the lengths be now? 

. What would the lengths be if 50 tests were done? 

d. What is the fewest number of tests that you would have to run so that a 99 
percent confidence interval would have length 2? 


о 


A dentist wants to obtain a 95 percent confidence interval estimate of the 
average pain threshold of patients as measured by a dolorimeter. The measure- 
ment is the amount of heat (in millicalories) to which the patient reacts. On the 
basis of other research work he feels that the results will be normally distributed 
with a standard deviation equal to 50 millicalories. 
a. He tests 100 patients and the resulting mean is y =230 millicalories. 
(i) Compute the 95 percent confidence interval for the true mean of the 
population. 
(ii) What is the interpretation of this confidence interval? 
(iii) Do you think that a second sample of 100 patients would yield a 
confidence interval of 190 to 210 millicalories? Why or why not? 
b. How many patients should the dentist study in order to estimate the true pain 
threshold of patients within 10 millicalories with 90 percent confidence? 


Resort community A in the mountains has been claiming that its lake is the 
purest (unpolluted) in the whole state. Their claim is that the true mean total 
solids in milligrams/liter is 40 with a о —5. Resort community B took 100 
samples from its lake and found mean total solids in milligrams/liter to be 39. 
Assuming o to be the same for lake B, place 95 percent confidence limits on the 
true mean total solids in community B's lake. Do you think it should publicize 
its finding and claim an even purer lake? Why or why not? 


The blood-clotting time of hemophiliacs is normally distributed with a mean of 5 
minutes and a standard deviation of 2 minutes. A new drug has been marketed 
whose efficacy is based on reducing the blood-clotting time of hemophiliacs. A 
random sample of nine hemophiliacs was chosen and given the new drug; the 
average blood-clotting time was 4 minutes. 

a. Assuming that the standard deviation has remained the same, place 90 
percent confidence limits on the true mean blood-clotting time of hemophiliacs 
using the new drug. | 

b. How many hemophiliacs would have to be tested (assuming с = 2) so that a 
90 percent confidence interval would have length 0.5 minute? 

c. Would the 90 percent confidence interval in (b) guarantee that the new drug 
would shorten clotting time? 


174 m EDUCATED GUESSING 


5.6.5 In a certain normal population ø is known to be 25. If it is required that one be 
95 percent confident that the y of a sample from this population shall be within 
4 of the true population mean p, how large a sample must be taken? 


5.6.6 Suppose it is known from long experience that the variability in a certain method 
of determining the concentration of a chemical in solution is given by а = .005 
(grams per cubic centimeter). Determine the number of measurements required 
to give a 99 percent confidence interval for concentration which is .001 grams 
per cubic centimeter wide. 


5.6.7 Suppose that you want to take a sample for estimating н, of sufficient size to give 
95 percent confidence that the resulting ў will be within 4 of ш. You decide that 
18 is a “surely large enough” value to assume for o. How large should the 
sample be? 


5.6.8 In people classified as normal in health, the mean serum haptoglobulin is known 
to be 100 milligrams per 100 milliliters with a standard deviation of 40 
milligrams per 100 milliliters. A random sample of 25 cancer patients were 
found to have a mean serum haptoglobulin of 114 milligrams per 100 milliliters. 
Using the known standard deviation, does a 95 percent confidence interval for 
mean serum haptoglobulin of cancer patients include the normal value of 100 
milligrams per 100 milliliters? 


5.7 A CONFIDENCE INTERVAL FOR [ 
WHEN o IS UNKNOWN 


In most practical situations o is really just as much unknown as и. We have 
to behave as in the preceding section if we want to calculate a required sample 
size. But we should certainly prefer to base a confidence interval on the y and s 
of our data rather than on y and an arbitrarily chosen value of o. What we 
want is a statement like (5.5.2), but with s replacing с. 

Let us track (5.5.2) back to its beginning. That beginning was the statement 


(5.4.2): 
Y-u =Z if Y is normal, 


=Z if Y is nonnormal and n is | ў 
ЈЕ dn is large 
If we now plan to replace с by s in our calculations, then there are two random 
variables in the fraction on the left: Y and s. And our argument about a 


confidence interval for џи. starts with the fraction 


(5.7.1) 


зр | 


This fraction does not have the probability distribution of Z. 


57 A CONFIDENCE INTERVAL FOR р WHEN с IS UNKNOWN ш 175 


When n is “large,” the fraction behaves іп a manner close to that of Z, and 
for many years statisticians had to rely on that approximation. But results were 
always dubious when small values of n were involved, and such cases became 
the frequent ones where the time, expense, or other difficulty of making 
observations force the investigator to limit the number of observations. Also 
there are many situations where an investigator can be satisfied with a fairly 
wide confidence interval, based on few observations for economic reasons, 
provided he can depend on the level of confidence. 


In the early years of this century, William S. Gossett, a mathematician 
working for the Guinness Brewery in Great Britain, worked out the exact 
probability distribution of the fraction (5.7.1) in the case where the population 
of Y has a normal distribution. He published his results under the pseudonym 
"Student." (There is a widely held belief that he was forced to use a 
pseudonym because his employers looked on the mathematical results as a 
potential trade secret.) The distribution that he derived came to be known as 
Student's distribution. The lower-case letter t became standard notation for the 
random variable, and now the distribution is customarily referred to as Student's 
t distribution, or simply the t distribution. 


The t distribution is a continuous distribution having а probability density 
function that graphs as a smooth curve above a t axis extending from —° to 
+0, The curve is symmetric about t — 0 and looks very much like the curve for 
the standard normal random variable Z except for being more widely spread. 
Its precise shape depends on the numerical value of n. Thus there is a different 
t distribution for every different value of n. 


After Gossett's breakthrough concerning the fraction (5.7.1), many other 
applications of t distributions were discovered. Sample size is involved in all of 
these, but in various ways for various applications. Hence something other than 
n is preferable as a parameter to identify the different t distributions. The 
choice that was made is a parameter called the number of degrees of freedom 
(abbreviated d.f.). The concept is the same one which we discussed in Chapter 
3 when taking n—1 as the divisor in s^. Indeed since s? is involved in the 
fraction (5.7.1), it turns out that the fraction has the t distribution with n — 1 d.f. 


The following diagram illustrates the general shape of t distributions and 
shows the reduction in spread as the number of degrees of freedom increases, 
with the standard normal distribution being the limit approached as the 
number of degrees of freedom goes to infinity. 


176 m EDUCATED GUESSING 


æ df. 
12 df. 
8 d.f. 
6 df. 


Standard normal distribution 


t 


0 


Since there is a different t distribution for every different number of degrees 
of freedom, it is impractical to give tables for t in the same detail as we have 
used in Table A-3 for the single distribution of Z. What is done is to table a 
few of the most useful percentiles of the t distribution for a variety of values of 
the number of degrees of freedom. Table A-4 in the Appendix is such a table. 


The following diagram illustrates the use of Table A-4. 


Probability density function of t with 12 d.f. 


S 
SS 


-1.083 0 1.083 


tis 150 tes . 


P(t« —1.083|12 d.f.) = .15, P(t« 1.083|12 d.f.) = .85, 
P(-1.083 «t—1.083|12 d.f.) = 70. 


Applying the probability theory developed by Gossett and others, we arrive 
at the following operational rule, to replace (5.4.2) when o is unknown: 


TSE 7twith n-1d.f. if Y is normal, 
È l-twith n-1d.f. if Y is nonnormal and n is large (5.7.2) 
vn nd n is larg 


From this rule we can proceed in exactly the same manner as in Section 5.5, 
arriving at a confidence interval to replace (5.5.2). 


57 A CONFIDENCE INTERVAL FOR p WHEN о IS UNKNOWN ш 177 


Example 5.7.1 

Consider the random sample of nine student G.P.A.'s that we drew in 
Section 5.3. There we found ӯ = 2.313, 52 = 0.7117, and s = 0.844. Assuming 
that the G.P.A. is normally distributed, we can apply (5.7.3) to obtain a 
confidence interval for p. Let us find a 95 percent confidence interval. Here t 
has n-1=9-1=8d.f. 

Table A-4 gives us the following probability diagram. 


Probability density function of t 
with 8 d.f. 


№ 


178 m EDUCATED GUESSING 


The limits of a 95 percent confidence interval for џ are 


844 844 
2.313+2.306( 7) =2.313+ 2306(:84) 
V9 3 


= 2.313+2.306(.281) 
= 2.313+.648, 


so that a 95 percent confidence interval for p is 
1.665<p <2.961. 


If this interval seems unduly wide for practical use, keep in mind that it is 
based on a sample of only nine observations. If tighter estimation of џ is 
required, a larger size sample will have to be taken. 


We mentioned earlier that the standard deviation of Y, which is (o/V/n), is 
called the standard error of the (sample) mean. In the same manner, (svn) is 
called the estimated standard error of the (sample) mean. Thus the limits in 
(5.7.3) can be remembered as “у plus-minus so many estimated standard 
errors.” The “so many” is determined from the tables of t (n — 1 d.f.) to give 
the degree of confidence we want. 


Comparing the intervals (5.5.2) and (5.7.3), we see that they have exactly the 
same general structure, with zx being used when we know ø, and tx being used 
when we do not know c and take s in its place. To keep the distinction straight, 
it may be helpful to look on (о; z) as a pair of running mates and on (s, t) as 
another pair. Having this convenient arrangement of one general structure, 
easily specialized to two different situations, is one of the rewards of accepting 
the (п — 1) in the definition of 57. If we had started with n, there would now be 
unpleasant contortions to bring in the degrees of freedom. 


One computational comment should be made. Calculating (o/v/n) or (s/ Уп) is 
usually unpleasant since n is not often a perfect square. We should always keep 


in mind the equalities 
ea aS ШЕ 
Vn Vn Ја Хи 


In either of these equalities, the member on the right is often easier to compute 
than the one on the left. This is almost always the case when s is involved, 
since we get 5° first and then have to extract a square root to get s. It is usually 
easier to make the division (s?/n) and then take the square root than it is to 
take two square roots, s^ and Vn, and then make the division. We also 


57 A CONFIDENCE INTERVAL FOR y. WHEN o IS UNKNOWN m 179 


generally get a bonus by having less rounding error. Even in Example 5.7.1, 
where n = 9, we do just as well with Мт: 


s: УУ 
os cy mis 
n 


Example 5.7.2 

A random sample consisting of 10 rats were placed on a fat-free diet. Their 
gains in weight were recorded after two weeks, and the mean gain was y = 60 
grams, with a calculated standard deviation s — 10 grams. Place 95 percent 
confidence limits on и, the true mean gain in weight for the population. 


Here t has 9 degrees of freedom, that is, the number of degrees of freedom 
associated with 57. For a 95 percent confidence interval, using the t table 
(Table A-4 in the Appendix) as we did before, we find t= 2.262, and then we 
have the 95 percent confidence interval as follows: 


5 5 
У—1 •—=< <y +t’ -= 
и vn Е Уп 


10 10 
60— 262159.) < <60+2.262(42) 
о лб 


100 100 
60-2.262) <6 «60226219 


60 —2.262V10 < p < 60 +2.262/10 
60 – 2.262(3.16) < p < 60 +2.262(3.16) 
60–7.15 Си <60+7.15 
52.85 Си <67.15 
Therefore the true mean p of the weight gain on a fat-free diet lies in the 
interval (52.85 <> 67.15 grams). I make this statement with 95 percent confi- 
dence. 


Example 5.7.3 

A random sample of 20 observations from a normal population yields 
ӯ = 84.7 and s?-24.68. What is a 99 percent confidence interval for the 
population mean p? 

Here we use t with 19d.f. Taking probability .99 in the center of the t 
distribution leaves .005 in the tail on the left and .005 in the tail on the right. 


Hence the t value on the left is {ооз (the 0.5th percentile) and the t value on the 
right is ts (the 99.5th percentile). Table А-4 shows these values to be —2.861 


and +2.861. 


180 m EDUCATED GUESSING 


The limits of a 99 percent confidence interval for u are 


[24.68 
84.7 x 2.861 01е 84.7+2.861V1.23 


= 84.7+2.861(1.11) 
=84.7 +3.2, 


giving the 99 percent confidence interval 
81.5 <р —87.9. 


Insisting on calculating (s/Vn) would lead to the following more tedious 
computation: 


111. 


In Section 5.6 we worked out a procedure for choosing a sample size n that 
would meet certain specifications for closeness of estimating д. That procedure 
requires the use of a known or assumed value of с. You might now think that 
we could use (5.7.3) to give us a way of choosing n without assuming a value 
for ø. But this is impossible. One half of the interval in (5.7.3) is t«(s/V n). If we 
specify that this should not exceed the amount d, we state 


n 


But tx cannot be known without knowing the number of degrees of freedom, 
and that number (п — 1) requires the value of n, for which we are now trying to 
solve. Moreover s cannot be known until we draw the sample and make the 
calculations. Thus there is no way to solve the above inequality for n. Hence 
the best we can do ahead of time is to choose n by the method of Section 5.6. 
After we draw the sample we should proceed by the method of the present 


section, getting a confidence interval for ш in accordance with (5.7.3), using tx 
and s. 


5.7 A CONFIDENCE INTERVAL FOR » WHEN o IS UNKNOWN m 181 


EXERCISES 


571 


5.7.2 


5.7.4 


5.7.5 


A sample of 16 from a certain normal population gave the results у = 74.92 and 
5 12.00. Give a 95 percent confidence interval for the population mean p. 


A random sample of 10 observations on the temperature of a kiln gave the 
following results (degrees centigrade): 45, 55, 68, 55, 51, 44, 42, 45, 53, and 37. 
Determine a 95 percent confidence interval for the true mean kiln temperature. 


A sample of 16 from a certain normal population yields у = 124 and s = 20. 
a. Give a 99% confidence interval for the population mean p. 
b. What is the precise meaning of “99% confidence interval" in this situation? 


A sample of n=10 observations is taken from a large number of accounts 
receivable. The calculated mean accounts receivable balance of these 10 ac- 
counts is $60 with a calculated standard deviation s — $10. Place 95 percent 
confidence limits on the true mean accounts-receivable balance. 


A random sample of 20 7-ounce “Sticky” shampoo bottles was selected. The 
net content in each bottle was determined. These are recorded as follows: 


69 8&0 72' 72 TARTAS 7:,0,86.02 1:0 
70 78 69 LE LS EO 8740 7.5 687.2 


Place 90 percent confidence limits on the true mean net content of a 7-ounce bottle 
of "Sticky" shampoo. 


A large department store had 100,000 accounts receivable at the end of the 
fiscal year. A random sample of 1700 accounts was taken and the following were 
calculated: 

y — 64 days (mean age of the accounts) 

s — 25.6 days (standard deviation of the age of the accounts) 


Calculate 95 percent confidence limits on the true mean age of accounts 
receivable. 


182 m EDUCATED GUESSING 


5.8 A CONFIDENCE INTERVAL FOR THE DIFFERENCE BETWEEN 
TWO MEANS, pi-p2, WHEN c; AND o; ARE KNOWN 


There are many practical situations where the important point at issue is the 
difference between the means of two populations. What is the difference in 
mean life between two brands of automobile tires? What is the difference in 
mean reduction of blood pressure accomplished by two different drugs? What 
is the difference in mean performance of students taught by two different 
methods? What is the difference in mean lung capacity between smokers and 
nonsmokers? 


In each such case there are two populations involved, one associated with 
one circumstance, one associated with the other circumstance: tire population 1 
for brand 1, tire population 2 for brand 2; population 1 treated with drug 1, 
population 2 treated with drug 2; student population 1 taught by method 1, 
student population 2 taught by method 2; population 1 composed of smokers, 
population 2 composed of nonsmokers. In each case, population 1 has mean 
and standard deviation ш. and ou, respectively, while population 2 has mean 
and standard deviation ш and 92, respectively. 


Right away there is the serious nonstatistical question as to whether tne 
difference i.i — ш» makes any practical sense. If tire population 1 is subjected to 
high-speed daytime driving in the desert while tire population 2 is assigned 
low-speed nighttime driving in the mountains, the difference ил = иг in mean life 
involves a lot more than just a difference in brands. If the population of 
smokers is composed of old men and the population of nonsmokers is com- 
posed of 20-year-old athletes, the difference ил— рг in mean lung capacity 
reflects much more than just smoking versus nonsmoking. Statistical inference 
is no substitute for logical scientific judgment. It is a powerful tool for 
facilitating decisions in Situations where all of the customary rules of sound 
scientific procedure have been followed. 


In scientific experiments, the characteristic observed and measured—the Y 
in our discussions—is often called the response of the population. In the above 
examples the response is tire life or reduction in blood pressure or student 
performance or lung capacity. The characteristic by which we identify two or 
more populations separately—the tire brand, the prescribed drug, the teaching 
method, the smoking status—is called a factor. The different specifications for a 
factor are called the levels of the factor. Thus we can say that we are going to 
study the response blood-pressure reduction under two levels of the factor drug, 
the factor levels being drug 1 and drug 2. 


The fundamental rule for a controlled experiment is that, except for the 
response that we shall observe, two populations should differ only with respect 
to the two levels of the factor at issue. This is the notion of having all other 
factors “controlled.” That is why the driving tests for tires will be prescribed 


5.8 DIFFERENCE BETWEEN р, – из, WHEN с, AND с, ARE KNOWN ш 183 


identically for brands 1 and 2; the tires whose lives we will observe should 
otherwise differ only as to brand. Students taught by method 1 should differ 
from those taught by method 2 only with respect to method; any other factors 
that could influence student performance should be operating at the same level 
in both populations. 


Such complete control in experimentation is not always possible. Even when it 
is possible, careful planning of the experiment may enable us to study the 
eflects of more than a single factor at a time. This is a large scientific subject in 
itself, and we have no intention of doing anything more here than mention its 
existence. 


This lengthy discussion has been meant to alert the reader to the need for 
good scientific sense before blindly applying statistical techniques to data. In 
what follows we assume that the two populations under discussion have been 
defined in this way, and the observations have been made under such condi- 
tions that the difference in means pı— p2 does make good sense. Our purpose 
then is to obtain a confidence interval for that difference. 


If from population Y; (having mean pı and standard deviation o1) we 
construct a random sample of size ni, then, as we have seen earlier, the sample 
mean Y, has a probability distribution in which the mean is pı and the 
standard deviation is PERS Similarly if from population Y2 (having mean p2 
and standard deviation сз) we construct a random sample of size n», the sample 
mean Y; has a probability distribution in which the mean is рг and the 
standard deviation is (al nz). What about the difference between the two 
sample means, Yı- Yo? 

Since Y; and У; are random variables, their difference у, – У; is a random 
variable. As such it has a certain probability distribution, with a certain mean 
and a certain standard deviation. Probability theory has established the follow- 
ing important facts about this distribution. 

1. The mean of Ў, У; is ui pz: 

шӯ, 9 #(7,- Yo) = i Из (5.8.1) 
2. If the two samples аге drawn independently, the standard. deviation of 
Y; — Ү, is the square root of the sum of the variances of Y: and Y»: 


= ал 02 
os, „= VEG %) – (из - о) jtm (5.8.2) 


if Y: and Y: are independent. 
the standard deviation of Ү\— Yi is 


As in the case of a single-sample mean, 
of the difference of (sample) means. 


customarily referred to as the standard error 


184 m EDUCATED GUESSING 


3. If Yı and Yz both have normal distributions, then the distribution of 
Y; — Y; is normal, for any values of n; and nz; otherwise, the distribution 
of Yı- Y; approaches the normal distribution as n; and n; both go to 
infinity. 

We can now go back to (5.4.2), rephrase it in terms of Y, — У, and then 

carry the discussion forward to a confidence interval in the form of (5.5.2). 


Example 5.8.1 


Two normal populations have standard deviations 10 and 15, respectively. 
Independent random samples were drawn from the populations, 20 observa- 
tions from the first and 25 from the second. The resulting sample means were 


84.7 and 72.1, respectively. What is a 95 percent confidence interval for the 
difference between the population means? 


The standard error of Y,— Y, is 


(10) (15) 100 225 
50 PEE TN 20 tas = У5+9=У14 = 3.74. 


The central 95 percent of the distribution of Z is contained between 2 = —1.96 
and z = 1.96. Thus z= 1.96. Hence by (5.8.4), the limits of a 95 percent 
confidence interval for Mi-p2 are 


(84.7 — 72.1) + (1.96)(3.74) = 12.6 + 7.3, 


5.8 DIFFERENCE BETWEEN р. – рг, WHEN с, AND o; ARE KNOWN m 185 


giving the 95 percent confidence interval as 
5.3« pı- џ2<19.9. 


Notice that throughout this interval р. is larger than p2, since the difference 
ра po is always positive, going from +5.3 to 4-19.9. The interval thus allows 
us to conclude that we are 95 percent confident that yi exceeds шг by an 
amount somewhere between 5.3 and 19.9. 

There can occur intervals with one or two negative boundaries. In such cases 
we have to draw the appropriate conclusions about the relative sizes of pı and 
po, as in the following examples. 


Example 5.8.2 
oi=8, n= 16, $17 40.2; 
зз 6, п = 9, y =47.4. 
PEL m V4+4= 8 = 2.83. 
The limits of a 95 percent confidence interval for pı— ро are 
(40.2 — 47.4) + (1.96)(2.83) =-7.2+5.5, 
so that the 95 percent confidence interval is 
—12.7<pi- џ2<-17. 


Here we see that ш: – ио is negative throughout the interval, showing that шг 
is always larger than pı in the interval. The excess of p2 over pı is somewhere 
between 1.7 and 12.7. It is clearer to express the difference in this direction, a 
maneuver easily performed by multiplying each member of the inequality by 
== 

121» -p t w> 1.7, 
that is, 


1.7 € p2- ja € 12-7. 
Example 5.8.3 
oi= 16, m=64, ў:=120.0 
с = 20, п = 80, А y2= 116.0 
256 „400 _ уд +5 = 49-3. 


т-с N 64 


186 m EDUCATED GUESSING 


The limits of a 95 percent confidence interval for и-и» аге 
(120.0 — 116.0) + (1.96) (3) = 4.0 + 5.9, 


giving the 95 percent confidence interval 
—1.9 Сш-иг «9.9. 


In this interval the difference between the means varies from where шо is 1.9 
larger than pı to where pı is 9.9 larger than џг. The interval includes the case 
Hai — Ba = 0, where pı and д» are equal. We have to end up with the conclusion 
that we can be 95 percent confident that the relation between the two 
population means is somewhere between the case where p> is 1.9 larger than 
Hi and the case where џл is 9.9 larger than из. This leaves the case of no 
difference (ш. = ш» = 0) within our confidence boundaries. 


Example 5.8.4 


Let us suppose that there are two different packing lines, number 6 and 
number 7, producing 1-pound cans of Ohwell Tea. While с; = .03 ounce is 
correct for line 6, line 7 is older and has o = .04 ounce. During a given day, 16 
random cans were taken from each line and the mean packed weights were 
recorded. The mean for line 6 was 16 ounces, and the mean for line 7 was 
16.04 ounces. Place 99 percent limits on the difference between the true line 
means. The solution is as follows: 


Line 7 Line 6 
ӱз = 16.04 ounces ӱ; = 16 ounces 
n7= 16 cans ns = 16 cans 
оз = 0.04 ounce св = 0.03 ounce 


Using the procedures previously followed, we find the 99 percent confidence 
limits for u7— ре to be 


[0016 .0009 
(16.04 — 16.00) + 2.576 16 * 16 0.04+2.576/.0001 +.00005625 


= 0.04+ 2.576У.00015625 
= 0.04 + (2.576)(.0125) 
= 0.04+ .0322, 


giving 
0.0078 < из — p6 <0.0722 


At the 1 percent risk of being wrong, we conclude that there is a real 
difference between the performance of the two lines, on the average line 7 
packing anywhere between -0078 and .0722 ounce more tea per can than line 
6. 


5.9 DIFFERENCE BETWEEN р, – р, WHEN с, AND с; ARE UNKNOWN m 187 


5.9 A CONFIDENCE INTERVAL FOR THE DIFFERENCE BETWEEN 
TWO MEANS, џл— ро, WHEN с: AND с: ARE UNKNOWN 


As in the situation involving just one population, we generally do not know 
o, and o>. Again large values of n; and m can encourage us to replace тї and 
oi by si and 52, respectively, and take (5.8.4) as an approximation to the 
confidence interval for ш. — p2. But again as in the one-sample case, there are 
times when important studies have to use small samples. Is there now a 
procedure employing the t distribution in a manner similar to that of Section 
5.7? The answer is yes, but there is a new difficulty. 


The theory that gives us a t distribution concerning Y;— Ү not only requires 
that Y; and Ү be normally distributed (as in the single-sample case) but also 
requires that the two populations have the same standard deviation: с. = 02. 
Such equality of standard deviations is a severe restriction, and of course it is 
not satisfied in general. However, the case where the difference in means 
pi— шг makes the most practical sense is the case where the standard devia- 
tions are at least approximately equal. Consider the following diagrams of two 
pairs of distributions. 


4 | Му => ренин O y 
| #2 


The diagram above shows two distributions having equal dispersions but 
different means. The difference in means gives a very good measure of the 
separation of the two chance processes. In the diagram on the right are shown 
two distributions differing both as to mean and as to dispersion. Here the 
difference in means is a very incomplete measure of comparison. Notice, for 
example, that extremely small y values have greater probability in population 2 
than in population 1, even though the mean of population 2 is larger than that 
of population 1. 


Because we are concentrating in this book on the most cleancut cases of 


statistical inference, we shall deal only with the ап = а; situation when compar- 
ing two means pı and p2 in the absence of precise values for c; and o». If you 
ask how we can assume that ал and o» are equal when we cannot assume а 
specific value for either one of them, we must answer that there do exist 
methods to check the assumption, methods that you would meet in more 
advanced studies. Such studies would also tell you what to do when the 
assumption єс = с cannot be tolerated. Fortunately the standard method that 
we shall present gives results close to exact when o1 and o> differ by a small 
amount and the sample sizes are equal or nearly so. 


188 m EDUCATED GUESSING 


Operating under the assumption that øf = 03, we would reasonably want to 
average si and s? in some way, to produce a single estimate of what is assumed 
to be the single value of сі and o3 (say o^). If the sample sizes n; and n; are 
different, we would want to give greater weight to the 5" that comes from the 
larger sample. Since we look on the number of degrees of freedom as a 
measure of the amount of information contained in a sample variance, we shall 
use degrees of freedom as the weights in our average. We thus produce what is 
called 


a pooled estimate of variance, designated s? 


(mi-Dsi*(nz-1) (m 1)st- (ni— 1)s? (5.9.1) 


2> 
m (ni— 1) (n2— 1) nitn;—-2 


As we would deduce from pooling the information in the two samples, s; has 
(ni— 1) - (n;— 1) degrees of freedom, that is, ni n;—2 d.f. 


Recall the formula for sample variance and clear the fraction: 


| REIR 
nz 


(n- 1) 2 X (y - yy. 


Using this equality in (5.9.1) gives an alternative formula which is often more 
convenient for calculating: 


e У (у ўз) +¥ (уг Ӯг)° _У (у: jy + У (уг— ӯ)? (5.9.1) 
2 (ni — 1) - (n2— 1) nitn-2 | 

This form is easy to remember since it tells us to pool the two sums of squares 

and pool the degrees of freedom, then make the division. 


If we now go back to (5.8.3) and use s? in place of both сї and o$, we shall 
have a fraction proved by mathematicians to obey the t distribution with 
hi +т—24.1: 


Qi- Y2- (ша – uj) ie d with ni*n;—2d.f., (5.9.2) 
Sp, 8 
ni n2 


provided Y, and Y; are normal and 91= со. The equality is a reasonable 
approximation if violation of the proviso is not excessive and the sample sizes 
are equal or nearly so. 


From (5.9.2) follows a confidence interval to replace (5.8.4) when с? and о? 
are unknown. 


5.9 DIFFERENCE BETWEEN р, – р, WHEN с, AND с; ARE UNKNOWN m 189 


Given independent random samples of sizes n; and п, respectively from 
the populations of Y, and Yz, a c percent confidence interval for the 
difference ш. — ро between the population means is 


2 E [52 55 а: [sp 52 
(yi— 92) — tx па битна (рађа + "Um (5.9.3) 


where па, n; = the two sample sizes; 
yi, ӯ = the two sample means; 
52 = the pooled estimate of variance; 
t= the confidence-limit value of t with из +12—2 d.f., 
determined by Роже « t«| ni n2—2 d.f.) = c percent. 


The interval is exact or approximate according to the conditions stated with 
respect to (5.9.2). 


The above estimate of standard deviation М/л) + (so/n2), is com- 
monly called the estimated standard error of the difference of (sample) means. 


Example 5.9.1 


A research investigator reports that he has analyzed independent random 
samples from two normal populations which have approximately equal vari- 
ances. His report gives the following information about his data. 


n^ 10, ў‹= 32.7, 51= 4.0; 
п = 12. уг = 24.3, 52 = 35: 
The pooled estimate of variance, according to (5.9.1), is 


, 9(4.0)? +11(8.5): _ 9(16.00) + 11(12.25) 
Sp 9-11 p" 20 


= 144.00 + 134.75 _ 278.75 _ 13.94. 


20 20 


190 m EDUCATED GUESSING 


The t distribution here has 10+12—2 — 20 (or 9-11 = 20) d.f. A 95 percent 
confidence interval for џл— p» requires 1» = 2.086 (tors for 20 d.f. in Table 
A-4). Applying (5.9.3), we can state that the limits of a 95 percent confidence 
interval for џл— џг аге 


П3.94 13.94 
(32.7 — 24.3) x 2.086 0 dart 8.4: 2.086V/1.39 + 1.16 


=8.4+2.086/2.55 


=8.4+2.086(1.60) 
=8.4+3.3, 


so that а 95 percent confidence interval for ш. – p2 is 
5.1 џи —– u2 «11.7. 


When we ourselves handle the sample data from the beginning, we of course 
use whatever intermediate calculations will simplify the computation of 52. Let 
us take two very small samples with simple data and carry through the entire 
procedure for obtaining a 90 percent confidence interval for pa ро. 


Example 5.9.2 


Sample 1: Sample 2: 
y: yi ys yi 
3 9 4 16 
1 1 y/ 49 
5 25 9 81 
7 49 10 100 
ата 11 121 
20 100 4 16 
45 383 
= 1x20 SNEV = 
у == =4.0 уе е Sao 
109 - 29) 383 - 45) 
si- si- $ 
4 5 5 
_ 400 2025 
^ 100 5 Е 385-76 
4 5 
= 100—80 20 .:383—397,5 45.5 
4 4 5 5 


=5.00 =9.10. 


5.9 DIFFERENCE BETWEEN p,—p WHEN с, AND с. ARE UNKNOWN m 191 


From (5.9.1') we have 


20+45. А 
5," 201255 623—728. 


The estimated standard error of Yi- У; is 


7.28 7:28 
Eum V1.46+ 1.21 7 v2.67 = 1.63. 


Here t has 5+6—2=9 (or 45-9) d.f. For a 90 percent confidence interval tx 
is tos, and Table А-4 gives that to be 1.833. Seeing that y» is larger than yi, we 
decide to take the difference in population means as рг — p1, and then have the 
limits of a 90 percent confidence interval for џг— p as 


(7.5 — 4.0) + 1.833(1.63) = 3.5 x: 3.0, 
from which it follows that a 90 percent confidence interval for џг— pa is 


0.5 <р - pa « 6.5. 


EXERCISES 


5.9.1 Two different automated mechanistic processes were proposed for packaging 
“Proud” dog food. Let's call them process number 1 and process number 2. Both 
methods had been extensively tested on a similar product by the manufacturers. 
Process 1 had с; = 2 ounces and process 2, а; = 1.8 ounces. The two processes 
were installed side-by-side and random samples of “Proud” raw materials were 
fed into both systems simultaneously. The following packaged weights (in 
ounces) were obtained from the two processes: 


— 


Process 1 Process 2 
58 58 57 57 
57 56 58 58 
56 57 56 59 
58 62 56 58 
57 59 56 


a. Determine a 95 percent confidence limit on the difference between the true 


mean weights produced by the two processes. 
b. Is there enough evidence for you to say that the two 
the same job? 


processes are not doing 


192 m EDUCATED GUESSING 


5.9.2 Two vendors have been asked to furnish random samples of “ЅирегсІеапѕег” 


5.9.3 


cans made with a cardboard which will minimize the moisture-vapor transmis- 
sion rate. The following figures are in grams per 100 square inches per 24 hours. 


Supplier A Supplier B 
.047 .049 .054 .050 
.047 .047 .052 .051 
.055 .051 .052 .054 
.053 .046 


Code the data by multiplying by 1000 and subtracting 40, thus getting 7, 7, 15, 

13, and so on. 

a. Calculate a 90 percent confidence interval for the true difference between the 
mean transmission rates. 

b. Is there any difference in the cans supplied by the two suppliers? 


Twenty new accounting men are to be trained in our accounting system. Two 
methods of training have been suggested. In order to test these methods, the 
trainees are divided into two groups. Group 1 is trained by method A and group 
2, by method B. After 3 months of training, a test set of material is given to all 
trainees. The scores of the 20 trainees are shown below. 


Training Method A Training Method B 
У, Y2 
96 92 96 94 
90 94 84 93 
93 83 98 86 
88 80 95 89 
86 98 91 94 


Place a 95 percent confidence interval on the difference between the two 
training-method means. 


Two suppliers are trying to furnish a manufacturer with alkyl-benzene. Each of 
the suppliers claims that his alkyl-benzene will cause a chemical reaction to yield 
the highest percentage of completeness of reaction. The percentage of complete- 
ness standard deviation for both materials is assumed to be с = 1 percent. 

The manufacturer did n4 — 10 reactions with alkyl-benzene from supplier A 
and па = 10 reactions with alkyl-benzene from supplier B. The following data 
were obtained: ( 


3:9.5 


5.9 DIFFERENCE BETWEEN р, – и, WHEN с, AND с. ARE UNKNOWN m 193 


Supplier A Supplier B 
962 95.8 963 97.2 
958 964 974 948 
96.3 96.2 954 968 
947 97.3 967 97.2 
956 968 959 97.0 


a. Calculate 95 percent confidence limits on the difference between the two 
percent completeness means. 
b. From which supplier would you recommend the manufacturer to buy alkyl- 
benzene? Why? 
a. Using the following sample data, find a 99 percent confidence interval for the 
population mean. 
y: 13,4, 12, 8, 7, 14, 15, 13, 6 


b. Consider the following data as a sample taken from a second population. Find 
a 99 percent confidence interval for the population mean. 


y: 10,9, 10, 6, 11, 13, 9, 2, 8. 


c. Give an estimate of the common variance of the two populations considered 


above. 
d. Determine a 99 percent confidence interval for the difference between the 


two population means. 
All accounting information is available from two plants. From plant A, 10 


random accounts were selected and checked for percent of errors. From plant B, 
eight accounts were chosen at random. The results are as follows: 


Plant A Plant B 
12.4 12.1 136 12.5 
10.6 12.0 124 136 
11.8 12.1 118 .128 
12.4 11.8 119 132 


a. Would you consider these plants different? Why? 
b. Why do you suppose only eight were taken from plant B? 


194 m EDUCATED GUESSING 


5.9.7 Ап analytic study reports the following data on measurement of a certain 
characteristic in treated and untreated water from a given basic supply, 10 water 


Treated Untreated 
Sample size 10 10 
Sample mean 92.46 98.16 
Sample variance 36.00 45.00 


samples having been used in each case. The characteristic under study is one 
which is generally regarded as being normally distributed. From the context of 
the report it appears that the two sets of data were independent random 
samples. Using 95 percent confidence limits, determine whether the true 
“treated” mean is significantly smaller than the “untreated” mean. (Show clearly 
all steps in your decision procedure.) 


5.10 A CONFIDENCE INTERVAL FOR THE 
BINOMIAL PROPORTION p 


In Chapter 4 we considered the chance process in which there are only two 
possible outcomes, such as “success” and “failure.” The probability distribu- 
tion governing this process is the binomial distribution, and the random 
variable is the number of successes in n independent trials of an experiment 
wherein the probability of success in any single trial is p. 


This distribution applies to a wide variety of practical stituations. Whenever 
the characteristic we want to study in a population is of the two-category kind, 
we have to deal with the binomial distribution. Such characteristics are very 
frequently the objects of important investigations: (a) voter opinion in a 
referendum (for, against), (b) status of a manufactured item produced under 
quality control (acceptable, not acceptable), (c) condition of a cancer patient 5 
ү Wd diagnosis (alive, dead), (d) sex of high-school teacher (male, 
emale). 


For any such characteristic, the experiment of drawing at random a member 
of a population and Observing to which of the two categories he (she, or it) 
belongs is conceptually the same as tossing a coin and observing head or tail in 
the result. Head can designate one of the two categories of the characteristic 
and tail, the other category. In general the coin is biased, having p as the 
probability of tossing head (the "success" of our earlier discussion). 


In any such situation our interest centers on the value of p, the probability 
that a single trial will give head. In most practical applications, we want 10 
interpret this probability in its long-run sense—the proportion of an unending 
sequence of tosses that will be heads. If a finite population is then large 
enough, we can look on p as the proportion of the population that belongs to 


5.10 A CONFIDENCE INTERVAL FOR THE BINOMIAL PROPORTION p m 195 


the category head: (a) the proportion of voters who are in favor of the 
referendum proposition, (b) the proportion of acceptable items coming off the 
production line (industrial quality control often looks at this from the opposite 
side, "percent defective"), (c) the 5-year survival rate of cancer patients, (d) 
the percentage of male high-school teachers. 


When we have a sample of n members drawn from a population, we check 
out each member of the sample, determining which of the two categories 
applies, tally the number of members in the “head” category, divide this 
number by n, and thus arrive at the sample proportion for “head.” This sample 
proportion we shall designate р. (There are various ways of speaking the 
symbol f: “р circumflex" would be precise but is never heard; “р hat” is the 
common expression in this country, with “р roof" being the favorite in certain 
other regions.) 


Statistical estimation of p proceeds in the same manner as estimation ofa 
mean. It is required that the sample be a random sample in the same sense we 
discussed earlier. It is only to the p of such a sample that we can apply the 
methods of inference which we shall discuss. Hence we define р in terms of a 
random sample. 


Given a random sample of m observations from a population, the 
sample proportion p is the observed fraction 


number of sample elements in category head" ; (5.10.1) 


p= 


Example 5.10.1 

A random sample of 200 women from a large population of women operated 
on for removal of breast cancer was observed over a period of time covering 5 
years after each woman’s operation. ‘During the period none of the women 
died of any cause other than cancer. Ten of the women died of cancer within 5 
years after the operation. The observed sample 5-year survival rate for surgi- 
cally treated breast cancer is then 


This proportion, as in the case of any such fraction, can be expressed as 95 
Percent, or 95 per hundred, or 950 per 1000, or in any other form adopted as 
conventional usage in a specific field of study. 


196 m EDUCATED GUESSING 


Since a random sample is a chance process, the number of sample elements in 
category “head” is a random variable. Hence so is the sample proportion р. 
When we draw one random sample, we get one observation on this random 
variable p—the observed sample proportion p. In order to make statistical 
inferences from such an observed value, we need to know the probability 


behavior of р from sample to sample—the probability distribution of р. 


Dealing with the exact probability distribution of p is beyond the scope of 
this book. Fortunately there is an approximate distribution that will give us 
satisfactory results in most practical situations. We can arrive at this by the 


theory that we have already applied to the sample mean Y. 


The sample proportion p is a sample mean. In our two-category situation, 
the population random variable Y is the binomial random variable with 
parameter p and n — 1. Remember that a population random variable is the 
random variable that applies to any single member chosen at random from the 
population. So in the present case we have Y as the binomial random variable 
for a single trial. Y is thus the number of “successes” (or “‘heads”’) in one trial. 
Obviously the only possible values for Y are 1 and 0; in a single trial we can 
only get one head or no head. Now when we have a random sample of size n, 
we have the n random variables Yi, Y2,..., Yn. Our observations give us the 
collection yı, уг,..., Yn, wherein each y; is either 1 or 0—1 if outcome is 
“head,” 0 if outcome is “tail.” When we add the y values, the total is precisely 
the number of “heads” in the sample. Hence the sample mean Y is 


= AY number of sample elements іп category "head" _. 
n n 


Making this interpretation of p as a sample mean Y, we can use all of the 
facts in Section 5.4. 
First we refer to (4.8.5) and find that, since our present Y is binomial with 
n=1, 
и=(Пр=р, 
= У(1)рд= Урф, where 4= 1-р. 


Using these values іп (5.4.1), we obtain 


But the Y here is our sample proportion p. Thus we have the following 
important facts about р. 


5.10 A CONFIDENCE INTERVAL FOR THE BINOMIAL PROPORTION p m 197 


If р is the proportion of “heads” in a random sample of size n drawn from a 
population in which the probability of a member's being “head” is p, then the 
probability distribution of p has the following mean and standard deviation: 


| Mean of p: Bp = p. 


(5.10.2) 
| Standard deviation of p: op = ya. where а= 1- p. 


Next we can apply the central limit theorem (Theorem 5.4.2) since p is a Y. 
That gives us the information we need for making an approximation to the 
probability distribution of p. If p is the sample proportion referred to in 
(5.10.2), then as n increases without bound, the distribution of р approaches 
the normal distribution with mean р and standard deviation vpq/n. 


How fast the distribution of р approaches the normal distribution as n 
increases depends on the value of p. The approach is fastest when p =}; it is 
very slow when p is near 0 or 1. It has been found that the normal distribution 
is a satisfactory approximation to the distribution of p if np >5 and па>5. 


The above discussion gives us the operational rule: 


Р=р г if np >5 and па >5, where 4= 1-р. (5.10.3) 
NE 
n 
Just as the operational rule (5.4.2) led to the confidence interval (5.5.2) for 
ш, the same logic will now give us the confidence interval 


P se Bep ep ze B. (5.10.4) 


But now a new difficulty has appeared: the unknown p is not only in the 
interior of the interval, where we want it, but also in the limits, where we 
definitely do not want it. The limits must be expressible completely in terms of 
2% and observed data or else we cannot calculate them. 

ns to the above interval and end up 


We can apply some algebraic manipulatio 
n. o and n. Those limits will be fairly 


with interval limits involving only z« P. П be 
complicated expressions, and there is a question whether the approximation we 
are using justifies the refined effort. The procedure that is customarily followed 
is to settle for an additional approximation by using p in place of p in the 
standard deviation Урд/п. We thus decide on the following statement of a 


confidence interval for p. 


198 m EDUCATED GUESSING 


Given a random sample of n observations on a population in which 
the probability of a member's being "head" is p, an approximate c 
percent confidence interval for p is 


в-ро: B, (5.10.5) 


where n = the number of observations in the sample; 
р = the proportion of “heads” in the sample, d = 1 — p; 
2% = the confidence-limit value of z, determined by 


Р(—2*< Z € z4) = c percent, 


and the approximation is satisfactory if np and nd are both 
greater than 5. 


As in other cases of sample means, the standard deviation v pq/n is usually 
referred to as the standard error of the (sample) proportion, and the estimated 
standard deviation Ура/п is called the estimated standard error of the (sample) 
proportion. 

Example 5.10.2 


What is (approximately) a 99 percent confidence interval for the 5-year 
survival rate in the population for which Example 5.10.1 gave a sample 
estimate? 


The data in Example 5.10.1 gave n=200 and р=.95. We note that 
пр = 200(.95) = 190, па = 200(.05) = 10; since both of these values are greater 
than 5, we judge (5.10.5) to be an acceptable approximation. For 99 percent 
confidence, 2+ = 2.58 (z 005 in Table А-3). Thus the limits of an approximate 99 
percent confidence interval for p are 


((.95)(.05) _ .0475 
.95=2.58 77200 .95+2.58 7200 


.95 =2.58/.000238 
95 +2.58(.0154) 
.95 +.0397, 


so that an approximate 99 percent confidence interval for pis 


.9103 Ср — 9897, 


or, in percent notation, 


91.0 percent < p < 99.0 percent. 


5,11 REQUIRED SAMPLE SIZE FOR ESTIMATING А PROPORTION p m 199 


It is of interest to report that the more advanced methods which use the exact 
distribution of p would here give us 


89.7 percent < p < 98.2 percent. 


5.11 REQUIRED SAMPLE SIZE FOR ESTIMATING 
A PROPORTION p 


In Section 5.6 we considered how to choose a sample size so that we could 
have a specified degree of confidence that the resulting sample mean у will be 
within a specified distance d of the population mean p. The same question of 
what sample size to use comes up in the case where we are estimating a 
population proportion p. The same argument applies; we want to specify that 
the come-and-go in a confidence interval shall not exceed d. 


The come-and-go in the interval (5.10.5) is of no use ahead of time since it 
involves p, which we cannot know until after observing the sample. The come- 
and-go in the interval (5.10.4) gives us a logical rule: 


EM (5.11.1) 


but this involves p, the very value that we are trying to estimate. If we really 
could assume a value for p, there would not be any reason to bother with 


estimation at all. 


We do have one mathematical fact to help us. Because both p and q are 
between 0 and 1, there is a limit to the value that pq can have under any 
circumstances. If we use this maximum value in (5.11.1) we can be assured that 
the resulting value of n is large enough regardless of the true value of p. In 
actual situations we may be able to do somewhat better. Our procedure goes 


according to the following argument. 


The graph of the quantity pq as a function of p is shown in the diagram: 
(overleaf). Here we see that pq never exceeds 1/4, and that that value occurs 
When p = 1/2. Hence if we use pq = 1/4 in (5.11.1) we are bound to get a value 
of n large enough to suit our requirement. [Recall the effect of the standard 
deviation in (5.6.1').] As a matter of fact, the value of n will almost always be 
larger than necessary, for the graph shows that the value of pq decreases away 
from 1/4 as soon as p moves away from 1/2. 


200 m EDUCATED GUESSING 


pa 


0.25 


p 
0 0.5 1.0 


In most practical situations we have some idea of at least a bracket for p. А 
reasonable procedure then is to také this bracket for p on the p axis in the 
above diagram and use the largest value of pq that occurs above the bracketed 
p interval. The following are some representative examples. 


a. 0€ p «€.4: pq rises steadily from 0 over this p interval, reaching its 
maximum when p —.4; that maximum is pq = (.4)(.6) = .24. 


b. .4€ p €.7: pq both rises and falls over this interval; its maximum is the 
overall maximum for pq: 1/4. 


c. .7 «p «1: pq is decreasing all the way from where p = .7 to where p 7 1, 
so that its maximum occurs where p=.7, that maximum being pq= 
(.7)(3) = 21. 

Example 5.11.1 


An investigator interested in the 5-year survival rate of women who had had 
breast-cancer operations is unsatisfied with the width of the 99 percent 
confidence interval in Example 5.10.2 (.91 Ср <.99). He would like to esti- 
mate p to within .02 with 99 percent confidence. His experience suggests that p 
is at least .9. How large a sample is required to meet his estimation specifica- 
tions? 


We see that the maximum value of pq over the interval .9- p «1 is the 
value at p —.9: pq = (.9)(.1) = .09. Using this value in (5.11.1), we have 


2.58 y CD. 02, 


2.58/.09 _ 
A 


2.58V.09 <.02Vn, 
129V.09<Vn, 
16641(.09) <n, 
1497.69 <n, 
giving 1498 as the minimum required sample size. 


.02, 


5.12 THE DIFFERENCE BETWEEN TWO POPULATION PROPORTIONS p,-p. m 201 


5,12 A CONFIDENCE INTERVAL FOR THE DIFFERENCE 
BETWEEN TWO POPULATION PROPORTIONS, p; – p: 


As in the case of the quantitative characteristics we studied in Sections 5.8 
and 5.9, there is often practical interest in the difference between two propor- 
tions: (a) difference between men and women as to proportion favoring a 
referendum proposition, (b) difference in percent defective between the out- 
puts of two different machines, (c) difference in 5-year survival rates of patients 
given two different kinds of operation, (d) difference in percentages of men 
on high school faculties in public and private schools. 


In Section 5.10 we arrived at the probability distribution of a sample 
proportion p by looking on p as a sample mean Y. So now, with independent 
samples from two populations, we can treat р: р as a difference between 
sample means, and then apply what we had in Section 5.8. The results are as 
follows, matching the facts (1), (2), and (3) in Section 5.8. 


1. The mean of pi—pz is p1— pz: 
bp, p р: Р2. (5.12.1) 
2. If the two samples are drawn independently, the standard deviation 


(standard error) of р: – рз is the square root of the sum of the variances of 
р: and po, these variances being given by (5.10.2): 


oj, + Pe (5.12.2) 
os т ni 


if р. and р» are independent. WV 
3. The distribution of $:—p» approaches the normal distribution as m and 


пг both go to infinity. 
As in the case of a single proportion, the normal approximation suggested in 


(3) is generally satisfactory if пара, паду, пар» M242 are all greater than 5. We 
are then led to the following version of (5.8.3) applied to sample proportions: 


(Êf) — (i1 J.z if mpi, №141, N2p2, 1292 
(5512.3) 
igi , P202 are all greater than 5. 


ni n2? 


202 m EDUCATED GUESSING 


There is no theory like that of the t distribution to give us a way of amending 
(5.12.3) as we amended (5.8.3) when we worked in Section 5.9. Hence in order 
to obtain a confidence interval for p;— p», we are forced to proceed as we did in 
Section 5.10, to admit an additional approximation by using р, and f. for pi 
and p; in the denominator of (5.12.3). We thus decide on the following 
statement of a confidence interval for р: – pz: 


Given independent random samples from two populations having binomial 
proportions p; and р», respectively, the respective sample sizes being пл and n», 
an approximate c percent confidence interval for р. = p> is 


(i- p) у р-р. <(pi—p:) za pu 2 2, 


ni n» 


where p; = ће proportion of “heads” in sample 1, 
рг = the proportion of “heads” in sample 2, 
а =1–р, G2=1-pr, 
zx = the confidence-limit value of z, determined by 
P(-z«« Z < 2») = c percent, 


and the approximation is satisfactory if пара, тада, пр, nad» are all greater 
than 5. 


Example 5.12.1 


The marketing organization for product T asked each of 70 housewives in a 
random sample from city 1 whether she had ever tried product T; 49 replied 
"yes." In a random sample of 120 housewives in city 2, 48 “yes” replies were 
given to the same question. What is an approximate 95 percent confidence 
interval for the difference between the two cities with respect to the percentage 
of housewives who have tried product T? 


We let p be the proportion of housewives in a city who have tried product T. 
Then we have 


49 
0 


pi= zu ро === 4. 


The estimated standard error of the difference bi po is 


[C7)€C3) (.4)(.6) |.21 24 
70 MEE = 70 *129 ^ ¥-003 + .002 = V.005 = .0707. 


5.2 THE DIFFERENCE BETWEEN TWO POPULATION PROPORTIONS p,-p,. m 203 


The confidence-limit value z« for 95 percent confidence is 1.96, since 
P(-1.96<Z « 1.96) = .95, 


and so the limits of an approximate 95 percent confidence interval for р. = pz 
are 


(.7— .4)+1.96(.0707) = 3 =.139, 
from which the approximate 95 percent confidence interval for pı— pz is 
.161 Ср: —р2<.439. 
Expressed in terms of percentage, the interval is 
16.1 percent < pi – p» < 43.9 percent, 


and we could report that the sampling indicates a higher percentage of 
housewives in city 1 have tried product T, and, with about 95 percent 
confidence, we think that the difference in percentage between the two cities is 
somewhere between 16 and 44. 


EXERCISES 


5121 A weights-and-measures inspector for consumer products went into a local 
grocery store to check on the weight of synthetic detergents. She walked to 
the shelf containing “Ebb” detergent. Each carton was cited as containing 20 
ounces of Ebb. The inspector chose n = 60 cartons, emptied the contents of 
each carton onto scales, and read the weight. She found six cartons whose 
contents were under 20 ounces; the rest were over 20 ounces. 

a. What is the estimate of the true percent under the marked weight of 20 
ounces? 

b. Place 95 percent confidence 

c. Discuss the implications О 
20-ounce cartons. 

а. What additional information available would help in assessing the percen- 


tage of a year’s supply under weight? 


limits on the true percent under marked weight. 
f using this sample as indicative of all Ebb 


5.12.2 In a random sample of 100 of a certain kind of seed there were 20 seeds that 
germinated. Give a 95 percent confidence interval for the number of seeds 


that will germinate if 400 are planted. 


out of 25 cases. Construct an 


5.12.3 A certain treatment is found effective in 16 С 
1 for the probability that the 


approximate 99 percent confidence interva 
treatment is effective in a single case. 

5.12.4 In a random sample of 100 articles produced in a certain process, 10 are 
found to have defects. Construct a 95 percent confidence interval for the 
defect rate of the process. 


204 m EDUCATED GUESSING 


5.12.5 


5.12.6 


5.12.7 


5.12.8 


5.12.9 


5.12.10 


To estimate the proportion of a population in favor of a certain proposal, a 
statistician questioned a random sample of 200 and found 80 favorable 
replies. Give a 99 percent confidence interval for the true population propor- 
tion p in favor of the proposal. 


In a certain achievement test 45 students out of 600 in district A and 39 out 
of 800 in district B received scores in the "superior" category. Determine a 95 
percent confidence interval for the difference between "superior" proportion 
in the two districts. 


In Trenton, New Jersey 148 men and 152 women were polled on the 
question “Do you approve, by and large, of the practice of tipping?” Eighty- 
nine men and 116 women replied “yes.” Determine а 90 percent confidence 
interval for the difference between the true approval rates of the two sexes. 
What inferences can you draw from this? 


А local grocery chain received a truck load of 5000 filled peanut-butter jars. 
In unloading the truck, several of the jars were found to have loose lids. If this 
were true of most of the jars, there would be a serious problem of spoiled 
peanut butter. Thus the manager decided to take a random sample of 200 jars 
and test them for loose lids. The test showed that six of the jars had loose lids. 
а. What is your best estimate of the percentage of loose lids in the shipment? 
b. Place 95 percent confidence limits on the true percentage of loose lids, 
€. Discuss how one would select 200 jars of peanut butter from the truckload 
of jars. Do you foresee any practical difficulties in doing this? 


What sample size will be surely large enough so that we can be 99 percent 
confident that the resulting р will not differ from the true proportion p by 
more than .025? How does your answer change if you can assume that p is 
certainly no larger than 0.1? 


A shipment of 1200 flashlights was received from Hong Kong. The flashlights 

were to be used as prizes in a contest. The contest manager decided he'd 

better try а few to see if they would light. He chose a sample of n = 100 

flashlights and tested them. 

à. How would you choose the 100 flashlights? Illustrate the procedure by 
selecting the first 10 flashlights. 

b. Assume that seven out of the 100 would not light. What is your estimate of 
the number of defective flashlights in the shipment of 1200? 

c. Place 95 percent confidence limits on the true percent defective. 

d. What аге the 95 percent confidence limits on the number of defective 
flashlights in the shipment? 


12 THE DIFFERENCE BETWEEN TWO POPULATION PROPORTIONS p,-p. m 205 


| A large accounting firm handled the billing for several small companies. Last 

year a total of 8000 bills were processed by the firm for the Handy Company. 

After receiving several complaints about billing errors, the Handy Company 

decided to take an audit on the bills (invoices). The auditor took a random 

sample of 400 invoices and found eight billing errors. 

а. What is your estimate of the billing-error rate? 

b. Place 90 percent confidence limits on the true billing-error rate. 

c. How many bills would you estimate to have errors? 

d. What would you tell the accounting firm if you were president of the 
Handy Company? 


6.1 THE ROLE OF STATISTICS 
IN THE SCIENTIFIC 
METHOD 


The procedure of investigation 
that has come to be known as "the 
scientific method" is a procedure of 
constructing a hypothesis about а 
condition or process or law of na- 
ture and then checking it against 
reality by means of observations. 
One forms his hypothesis, makes 
observations related to it in the 
real world, compares the results 
with what the hypothesis 
hypothesizes, and then accepts or 
rejects the hypothesis according as 
to whether the observed results 
match the hypothesis. 

One of the first dramatic exam- 
ples we learned about in childhood 
was Columbus’s test of the 
hypothesis that the earth is a 
sphere (“the world is round"). Ву 
sailing west from Spain he would, 
according to the hypothesis, even- 
tually reach the Far East. Confirm- 
ing or denying the hypothesis was 
delayed for a time since he bumped 
into the Americas on the way. The 
movement of the earth around the 
sun, rather than vice-versa, was for 
man at first a hypothesis; so was 
the pattern of circulation of the 
blood in the human body, the 
effectiveness of vaccination, the ac- 
tion of penicillin, and Einstein's 
famous equation E = mc’. 

When a hypothesis about a pro- 
cess in Nature has to do with one 
of Nature's chance processes, it is 
generally not a straightforward 
matter to check the hypothesis 
against the observations. In any 


206 


To Reject Or 
Not To Reject 


6.1 THE ROLE ОР STATISTICS IN THE SCIENTIFIC METHOD m 207 


BOTH OUR HITTING ANO 
OUR FIELDING AVERAGES 
WERE DOWN THIS YEAR... 


PEANUTS ПОЧТИ 
-1—1 HEARD THE 
REPORT FROM OUR 


GET A NEW 
STATISTICIAN !!! 


50 YOU ALL KNOW WHAT WE 
HAVE TO DO NEXT SEASON 


chance process the various possible observations will occur. according 
to the probability distribution of the process. There is natural variability among 
such observations, the different outcomes happening in the various proportions 
specified by the probability law. Thus when we use a sample of observations to 
test a hypothesis, we have to face the fact that the observations will not fit the 
hypothesis perfectly even if the hypothesis is true. 


Looking at the discrepancy between hypothesis and observations, the inves- 
tigator has to sort out the difference that can be attributed to chance and the 
difference that must be charged to the falsity of the hypothesis. As in other 
Studies involving chance processes, we can never know the truth; all we can do 
is make decisions according to procedures which have good odds of producing 
correct decisions. It is in such decision-making that statistical inference has a 
role to play. 


where we calculated the probability of 


Recall the example in Chapter 4 culat ity o 
rolling “7” five times in succession with а pair of fair dice. That probability is 
h an all-7 quintuple of 


(1/6) = 0001286, indicating that in the long run suc = 
rolls will occur about 13 times out of 100,000, on the average. As specified, 


208 m TO REJECT OR NOT TO REJECT 


this probability is based on the assumption that the dice and rolls are “fair,” 
Now suppose that you pick up a new pair of dice and at once roll 7, 7, 7, 7, 7. 
If we can assume our rolls to be a random sample of all possible rolls, what are 
we to think of our remarkable result? There are logically two alternative 
conclusions: (a) the dice are fair dice and an extremely rare event has occurred 
or (b) the dice are not fair dice, having instead a probability distribution in 
which the occurrence of five successive 7s is not a rare event. 


Our curiosity would no doubt compel us to suspend judgment and make a lot 
more rolls. But suppose that for some reason we had to choose between (a) 
and (b) after the five rolls. The natural reaction to (a) is that commonly used 
expression, “It’s possible but not probable." It does not seem reasonable that 
the outcome of a single sample from a population should be one of the 
outcomes that had a very small chance of occurring. It can happen but it is 
very unlikely. With such a point of view, we would choose (b) as our 
conclusion; that is, we would reject the hypothesis that the dice are fair. We 
would grant that there is a risk of our being wrong, but we would judge the risk 
to be very small. 


This is the rationale of the statistical test of hypothesis or test of statistical 
significance. In practice, the word *'statistical" is generally omitted from both 
of these expressions, but its presence should always be clearly understood, 
because the test is a procedure of statistical inference, with conclusions phrased 
in terms of probability. 


6.2 THE LEVEL ОР SIGNIFICANCE m 209 
6.2 THE LEVEL OF SIGNIFICANCE 


In order to control the risk of rejecting the hypothesis when it is in fact true, 
we choose ahead of time the probability boundary below which we are going to 
call an experimental outcome “possible but not probable." Commonly used 
values are 5 and 1 percent. Such a probability value, chosen in advance, gives 
us our definition of how rare an experimental result will have to be in order to 
make us consider it rare beyond the credibility of chance. 


In order to fix ideas by an example free of mathematically technical difficul- 
ties, consider the following situation. In a certain manufacturing process, the 
production line includes a weighing machine set to deliver 12 ounces of the 
manufactured material into a cardboard box. Boxes thus filled are produced in 
large quantity and marketed with a label declaring “contents: 12 ounces." 
Long experience has shown that the weighing machine does not deliver 
precisely 12 ounces to each and every box, but that the amount delivered is a 
random variable that is normally distributed, with standard deviation small and 
very stable over time (ø = 0.1) but the mean (ш) subject to shifts away from the 
preset 12 ounces. The quality control section of the company makes a periodic 
check by taking a random sample of 25 filled boxes and weighing the contents 
on an elaborately calibrated scales. 


The question is: How far below 12 must the mean y of the sample of 25 
boxes be in order to justify a decision that the mean p of the production line 
weighing machine is less than 12? The company hypothesizes that џ = 12, 
against the alternative hypothesis that и < 12. It has become customary to refer 
to the basic hypothesis as the null hypothesis and to use for it the notation Ho, 
suggesting the idea of “по difference," in our case no difference between the 
process mean ш and the specification value 12 (ounces). We can designate 
the alternative hypothesis by Ha, and thus label our test as follows: 


Ho: џ 7 12 versus Ha: p<12. 


The company wants to run only a small risk of rejecting Ho (and thus 
accepting Ha) if Ho is in fact true. For the sake of example we suppose the 
limit of tolerable risk to be 5 percent. We then argue that the у values that 
justify rejecting Ho are those far enough below 12 as to have no more than a5 
percent chance of occurring if p is in fact 12. Any such y value will be 
considered as “‘significantly” less than 12, the term significant being construed 
in a technical statistical sense as meaning “beyond reasonable attribution to 
chance." It is in relation to acting as the criterion for this judgment that the 5 
percent is called the level of significance. A common general notation is the 
lower-case Greek letter "alpha," so that we speak of a = .05 as being the same 
as the 5 percent level of significance. 


210 m TO REJECT OR NOT TO REJECT 


6.3 THE CRITICAL REGION 


The y values far enough below 12 to cause us to reject the null hypothesis 
that џ is 12 make up a set of values called the critical region of the test. The 
values are “‘critical’’ in the sense that if the sample produces any one of them it 
causes Но to be rejected. The probability а associated with the set of critical 
values is referred to as the size of the critical region; it is "size" in the sense of 
probability mass. 


According to Theorem 5.4.1, the ys of samples of size 25 in our present 
situation will be distributed in a normal probability pattern having the popula- 
tion mean w as its mean and the standard deviation o/Vn, specifically 
(0.1//25) = (0.1/5) 20.02. That is, Y is normal with mean p and standard 


deviation 0.02. Now if Ho is true, then и = 12 and hence Y is normal with 
mean 12 and standard deviation 0.02. 


We can then determine the critical region specifically, as follows. 


—1.645 0 


a = 0.05 = P(Z « —1.645) 


= P(Y-12«-0.0329) = P(Y < 11.9671). 


Thus if Ho is true, there is just 5 percent probability that the Y of a sample will 
turn out to be 11.97 or less. Hence the critical region of the test is the interval 
$11.97; if Y falls in this region we shall reject Ho. 


We can specify the above critical region either directly in terms of у ог 
indirectly in terms of z: 


yx11.97 
ог 


2 =–1.645, where z .1-12 


6.4 PERFORMING THE TEST m 211 


The critical region is customarily stated in the manner of a decision rule: 
reject n if ў < 11.97; accept Ho otherwise. This rule could be stated also in 
terms of z: 


If Ho is true, then 


6.4 PERFORMING THE TEST 


After the test procedure has been set up in accordance with Steps 1-3 above, 
it remains only to take the sample, compute the resulting value of the test 
statistic, and make a decision in accordance with the decision rule. 


In the example used for the discussion above, an actual sample of 25 boxes 
yielded ӯ = (298.71/25) = 11.948. This value falls in the critical region ( = 
11.97) and so we must reject Ho. In terms of the critical z region (z < —1.645) 
we would state: 


From the sample data, yielding y = 11.948, we have 


11.948-12 . —0.052 _ —2.6. 
4.0004 0.02 
Reject Ho. 


212 m ТО REJECT OR NOT TO ВЕЈЕСТ 


There is a difference of opinion among experimental scientists, and among 
statisticians, as to whether accepting Ho is the logical alternative to rejecting Ho. 
The argument is that rejection is reasonable when an experimental result seems 
beyond the limits of chance, but acceptance should require much more than 
having the results of a single sample be better than improbable. If one is ruled 
by this argument, he replaces “accept Ho" by "nonreject Ho," “do not reject 
Ho," or "suspend judgment." 


In a mathematical sense, this is only a matter of semantics. The decision rule 
is a two-decision rule: reject Ho or take some other action. Accept Ho" came 
into usage as the most obvious opposite of "reject Ho," but other terms will fit 
the test procedure satisfactorily. We shall consistently use reject Но and accept 
Ho as the alternatives, but in both cases we shall extend the statement to 
include some assertion about "significance." 


The test procedure that we have outlined is called both test of hypothesis and 
lest of significance. It is the same test under either name. As a test of 
hypothesis, it tells us how to decide whether or not to reject a null hypothesis. 
As a test of significance, it tells us whether sample data are or are not 
significantly contrary to the null hypothesis. One's decision is most helpfully set 
forth if it is reported in both ways. Thus in our example above, we would 
report: Reject Ho; the observed sample mean box content is significantly less 
than 12 ounces, at the 5 percent level of significance. 


Notice that the null hypothesis (Ho: џ = 12) is either rejected or accepted 
(nonrejected); there can be no matter of "significance" involved in the state- 
ment of a hypothesized population value. On the other hand, a sample value 
(here y — 11.948) is not a hypothetical one; it has actually been observed, and 
either is or is not significantly in line with НА alternative to Ho. Notice also 
that it is essential to state the level of significance since that is the criterion 
used to define significance. 


6.5 THE DESCRIPTIVE LEVEL OF SIGNIFICANCE m 213 
65 THE DESCRIPTIVE LEVEL OF SIGNIFICANCE 


There are many situations in which the experimental scientist insists that 
there is no reasonable way to choose a value for o, the level of significance. 
Without an а value, he of course cannot set up a decision rule in the manner of 
the foregoing test procedure, since he has no criterion for "significance." Such 
investigators prefer to calculate the value of the test statistic from the sample 
data and then ask for the probability that a value “аз excessive" as that one 
would have occurred if Ho were true. 


In our example of box weight we would have the following: 
у = 11.948, z= —2.6; 
P(Y «11.948 | Ho) = P(Z < -2.6) = 0.0047. 
This tells us that the probability is about 0.5 percent that a sample mean as far 
below 12 as the one we got would occur if in fact the mean p were 12. It thus 
argues strongly against Ho. 

Such a probability is customarily labeled P and is reported so that the 
investigator or reader of the report can decide for himself whether the value is 
small enough to justify the judgment that the sample result was so unlikely, if 
Н, were true, that Ho must be rejected. P is thus the smallest value of a at 
which the given test statistic could be ruled significant. It is thus a kind of ex 
post facto level of significance, and has been given the name descriptive level of 
significance: 


P — descriptive level of significance 
— P(test statistic value as excessive as the one 
observed | Ho) 


(6.5.1) 


like 5 percent or 1 percent, is 


Even when a customary level of significance, er 
-value as additional 


used in a test of hypothesis, it is useful to report the P 
information about the sample. 


214 m ТО REJECT OR NOT ТО REJECT 


6.6 ONE-TAILED AND TWO-TAILED TESTS 


In our example we had Ho: џ = 12 versus Ha: ш — 12. Here the alternative 
hypothesis is a one-sided alternative to Ho: just those values less than 12 are of 
interest to the test. As we saw, this led to a critical region of z values in the 
left-hand tail of the standard normal distribution. In this sense the test is a 
one-tailed test. Had our interest centered on values larger than 12, we would 
have had Ho: н = 12 versus Ha: p > 12, and again the test would have been 
one-tailed, this time having the critical region in the right-hand tail of the z 
distribution. 


When НА is a two-sided alternative to Ho, as in the case of Ho: ш= 12 
versus Ha: ш 12, there are two tails of interest in the distribution of the test 
statistic, and the critical region is composed of two parts, one in each tail. In 
the absence of any special considerations, the size о is divided evenly between 
the two tails, 0/2 for each of the two parts of the critical region. 


Example 6.6.1 


To make a convenient comparison of the one-tailed and two-tailed tests, let 
us go back to our example of weight of contents in boxes marked “12 ounces," 
and now set down the complete procedure of a two-tailed test, using the same 
(5 percent) level of significance. 


Ho: u = 12 versus Ha: ш 12. Y is N(u, 0.1); random sample, 
п= 25; a=.05. 
If Ho is true, then 


Reject Ho if z « —1.960 or if z 2--1.960; accept Но otherwise. 


:025 .025 


—1.960 0 1.960 
From the sample data (giving y = 11.948) we have 
THE 11.948— 12 --=0.052 0.052 _ 


== 2:60. 
У.01/25 4.0004 0.02 


Reject Но. The observed sample mean box content is significantly different 
from 12 ounces, at the 5 percent level of significance. 


P=P(Z=-2:60 or Z =+2.60) = 2(.0047) = .0094. 


6.6 ONE-TAILED AND TWO-TAILED TESTS m 215 


Two things about the conclusion of the test should be given special notice: 


im 


2: 


The statement about significance is related to the alternative hypothesis 
Ha. Here Ha specifies џ = 12. Hence the “significance” concerning the 
sample mean y has to do with y being different from 12, whether it is 
greater or less than 12 being beside the point. The critical region treats 
both alternatives equally so that the important characteristic at issue is 
difference from 12; we must state our conclusion consistently with this. 


The observed statistic value will be in one tail or the other, but the P 
value must take into account two tails whenever the critical region 
involves two tails of a distribution. This is the reason why the P-value is 
defined as the probability of obtaining a statistic value “аз excessive as" 
the one observed; in a two-tailed test this means “аз far out, in either 
direction, as the one observed.” 


EXERCISES 


6.6.1 


In each of the following cases, consider that there is a random variable Y 
under study, that its standard deviation с is taken as known, and that it is 
desired to test a hypothesis concerning its mean p. Set up the test of the 
specified hypothesis, given the related facts as indicated. 

a. џ = 50 versus џ 750, given с =10, n=25, a= .05. 

b. ш = 75 versus џ >75, given а = 8, n =50, o =.01. 

с. р = 300 versus p < 300, given а = 20, п =16, a=.05. 


In a certain normal population ø is known to be 8. 

a. Set up the test (that is, give the first three steps in the standard test 
procedure) for testing Но: р = 50 versus Н,: и >50, at the 5 percent level of 
significance, using sample size n — 16. fy 

b. Notice that the critical region 2 2 Z. (where Ze is the proper critical value of z) 
can be expressed in the form у >k (by making use of the fact stated in Step 2 
of the test). Find this form in the above case, and then express your decision 
rule in the form “Reject H, if y> ; accept Ho otherwise.” Now what 
is the probability that you will accept Ho if the population mean p 1$ actually 
55? 


Suppose that with a certain intelligence test the distribution of 105 is known to 
be normal with standard deviation 16. Suppose also that there is a policy to 
conduct a special study of the school program in any group where the mean Г.О. 
is less than 90. A class of 36 students from a certain group is to be tested, and on 


the basis of the results a decision is to be made as to whether the special study 


should be instituted. Set up the appropriate test of hypothesis, taking significanee 


level .01. What does this procedure assume about the relationship between the 
class and the group from which it came? What is the probability of deciding in 


favor of the special study if the group mean is really 807 


216 m ТО REJECT OR NOT TO REJECT 


6.6.4 The life of a pair of army shoes is stated to be normally distributed with mean 
ш = 12 months, and standard deviation с = 2 months. The supply sergeant for 
company A in the 198th infantry regiment kept records on the life of army shoes 
in his company. He had to replace 100 pairs of shoes last year. When he 
calculated the average life he found it to be y — 11.7 months. Using а =5 
percent, is this sufficient evidence for the sergeant to complain to the supply 
officer about the quality of the shoe wear? Draw your own conclusions using 
your own choice of a and state justification for any actions you recommend. 


6.6.5 Тһе diameters of certain shafts must be less than 1.500 inches to be usable. The 
shafts are produced by a process that gives a normal distribution with a mean 
diameter џ = 1.490 inches and a standard deviation с = .005 inches. А metallur- 
gical company using these shafts registered a complaint with the supplier. The 
company said that the average shaft they'd been receiving was y — 1.495 inches 
based on a sample of n — 49 shafts and they had to reject too many shafts. Using 
а =5 percent, do you agree with the company's complaint? Why or why not? 


6.7 TESTS CONCERNING » WHEN o IS UNKNOWN 


The logic of every statistical test of hypothesis is precisely the same as that 
discussed above. Each test has the Steps 1—3 of Section 6.3, and the conclusion 
is carried out in the manner of Section 6.4. What changes from test to 
test is the specific detail of the steps. For any given set of conditions in Step 1, 
it is the role of the mathematical statistician to discover optimum procedures 
for Steps 2 and 3. The procedures which we present in this book have been 
thus worked out by statistical theorists, and confirmed in usefulness by applied 
practice in diverse fields of investigation. 


As in the case of estimation, the most important test situation involving a 
population mean is the one in which the standard deviation ø is unknown also. 
Here we need only apply the relation (5.7.2) in Step 2 and then proceed with à 
t-test in the same way we used the z-test above. 


Example 6.7.1 


There is concern about strontium-90 radioactivity in milk. Measurement of 
such radioactivity is in terms of the number of picocuries of radiation per liter. 


A certain standard sets 5 picocuries per liter as an acceptable limit for a 
milk-supply mean. A given milk supply is checked by use of a random sample 
of 10 units taken from the supply. The concern is whether the supply mean 
exceeds the 5 picocurie standard, and a 1 percent level of significance is used. 
In one test the sample gave the following results: 7.9, 9.1, 9.8, 8.4, 10.1, 7.6, 
8.2, 9.9, 10.2, and 11.0. We assume the radioactivity per unit to be normally 
distributed. 


67 TESTS CONCERNING p WHEN с IS UNKNOWN m 217 


Ho: ш = 5 versus Ha: p>s. Y is normal; random sample, 
n=10; a=.01. 
If Ho is true, then 
Y-5 з 
— t with 9 d.f. 
Vs7/10 


Reject Ho if t 2.821; accept Ho otherwise. 


Probability density function of 
t with 9 d.f. 


0 2.821 


y y 

79 6241 222203; 

91 8281 1 

= =e 22) 8500.84 
84 70.56 в61.88-0227 861.88- 50 — 
10.1 102.01 ес =y 
76 57.76 9 

AR а 1.88—850.08 11.80 _ 

99 98.01 ,86188 E9905 = 131. 
10.2 104.04 

11.0 121.00 i 912278 4.22 _ 4.22 _ 11 
922 861.88 1.3110 У0.131 0.362 


Reject Но. The observed sample mean radioactivity count is significantly 
greater than 5 picocuries per liter, at the 1 percent level of significance. 


P «0.0005. 


The P value is exactly 
p= P(tz 11.66 |9 d.f). 


In Table A-4 we see that 11.66 is beyond dolana E. 1 Ре а 
and that largest entry is the 99.95th percentile. Thus P(tz 4. Сни 
0005, and so, since P(t> 1661991) abu 11901), wenns ў 


0005. 


218 m TO REJECT OR NOT TO REJECT 
6.8 RELATION BETWEEN TESTING AND ESTIMATING 


Each of the above examples of test of hypothesis concerning a population mean 
ш shows that we use the same statistic and the same probability distribution 
theory as in making’a confidence interval estimate of ш. This relationship is 
true in general for estimating a parameter on the one hand and testing a 
hypothesis concerning it on the other. The circumstance is of course not 
surprising, since a given chance situation involves a single probability structure 
and yields the same observations no matter what use we intend for the data, 


The distinction between estimation and hypothesis testing is one of purpose. 
The test is a go-no-go gauge; its objective is a decision to reject or not to reject 
a stated null hypothesis. The confidence interval is a bracket for a parameter; 
its objective is a dependable statement of boundaries for the actual value of the 
parameter. High likelihood of being correct is provided in each procedure by 
use of the probability structure of the sample statistic, controlling at a low level 
(the level of significance) the probability of falsely rejecting a null hypothesis, 
controlling at a high level (the confidence coefficient) the probability of 
constructing a correct interval for the parameter value. 


Consider the situation in Example 6.6.1. There we had a test of Ho: p =12 
against the alternative НА: #12, and rejected Ho at the 5 percent level of 
significance, on the basis of a sample mean у = 11.948. An obvious question is: 
If you believe is not 12, what value do you believe ш does have? The answer 
can be given by a confidence interval. The limits of a 95 percent confidence 
interval are 


01 
11.948 + 1.960 25 


= 11.948 + 1.960 /.0004 
= 11.948 + 1.960(.02) 
= 11.948 x 0.039, 
so that a 95 percent confidence interval for ш is 
11.909 < u < 11.987. (6.8.1) 


We can notice that this interval does not include the value 12. Hence p = 12 is 
outside the boundaries of our 95 percent confidence, and this is consistent with 
our having rejected Ho: ш = 12 at the 5 percent level of significance. 


6.8 RELATION BETWEEN TESTING AND ESTIMATING m 219 


Such a relationship is no mere coincidence. The confidence-limit value of z for 
a 95 percent confidence interval is precisely the same (1.96) as the critical value 
of z for a two-tailed test at the 5 percent level of significance. The decision rule in 
the test was: reject Ho if z « —1.960 or z = +1.960; accept Ho otherwise. This 
makes the acceptance region 


—1.960 < z < 41.960, 


and, in terms of y and a hypothetical value po for the mean p, that acceptance 
region is 


-1.960 «3 E° 1.960, 


—0.039 < 5 — po < 0.039, 
ӯ — 0.039 < po < j + 0.039. (6.8.2) 


Thus any hypothetical value of ш outside the limits in (6.8.2) would be rejected 
in a two-tailed test at the 5 percent level of significance. But the limits in 
(6.8.2) are precisely the limits of a 95 percent confidence interval for џ, as we 
saw in obtaining (6.8.1). 


In general, a c percent confidence interval for a population parameter 
contains all of the values that could be accepted in a two-tailed test of 
hypothesis about that parameter at the (100—c) percent level of significance. 
There is a similar relationship between one-tailed tests and one-sided confi- 
dence intervals but these are not considered in this book. 


If the investigator wants to pursue analysis beyond just a decision to reject or 
not reject a null hypothesis. he customarily starts with a confidence interval. 
This is especially pertinent when a null hypothesis is rejected. Many times the 
test of hypothesis is a screening process to uncover “significant” cases for more 
detailed study. 


А confidence interval following a test of hypothesis is usually the two- 
boundary interval that we have studied, even when the test is one-tailed. In 
such cases one must bear in mind that the confidence-limit value of z or t is not 
the same as the critical value in the test. The following cases from our earlier 


examples illustrate the point. 


Example 6.8.1 

Our first example, introduced in Section 6.2 and carried along through the 
Various test steps, involved Но: џ = 12 versus Ha: <12. The test was con- 
ducted at the 5 percent level of significance, with the critical region being 
z x —1.645. We observed у = 11.948 and rejected Ho. A 95 percent confidence 
interval to follow this test would be exactly the same one (6.8.1) that we 
constructed after the two-tailed test using the same level of significance and the 


same value of y. 


220 m TO REJECT OR NOT ТО REJECT 


Example 6.8.2 


In Example 6.7.1 we had a t test of Ho: р = 5 versus Ha: 4475. At the 1 
percent level of significance the critical region was 12 2.821, the critical value 
being the 99th percentile of t with 9 d.f. For a 99 percent confidence interval 
for џ we must proceed as specified by (5.7.3), using data in Example 6.7.1 to 
obtain the limits 


9.22 + (3.250)(0.362) 
=9.22+1.18, 


thus obtaining the 99 percent confidence interval 


8.04 < p < 10.40. 


EXERCISES 


[Consider it satisfactory to assume that Y is essentially normally distributed in all the 
following exercises.] 


6.8.1 In testing Ho: џ —3.5 against Ha:  >3.5, in a normal population, a sample of 
size 20 yields y = 3.591 and 5 — 0.1527. What decision should be made at the 1 
percent level of significance? 


6.8.2 For what values of у would you reject the null hypothesis in a two-sided test for 
each of the following situations? (a) Hy: и = 0.8, given а = 0.2, n = 16, а =.05, 
(b) Ho: » =0.8, 5=0.2, п = 16, а = .05, (c) Hy: ш= 10, с= 2, п = 100, а = .10, 
(d) Ho: u =10,5=2, n = 100, а =.10, (е) Hs: u = 40, = 5, п= 4, а = .01, and 
(f) Ho: џ =40, 5 =5, п =4, а = .01. 


6.8.3 Let us assume that regular accounting procedures turn up numerical errors 
averaging about $21.40 per month. A new method of accounting involving some 
automation is under investigation. We would like to determine whether the cost 
of numerical errors will change. We desire to run a risk of 5 percent of being 
wrong when we say that there has been a change in the average dollar error per 
month. A sample of n — 20 observations using the automated system was taken 
and the following calculations were made: 


у = $21.50; 5 = $0.40 


а. Is there sufficient evidence to warrant taking action on a change in dollar 
errors per month? State the hypothesis to be tested. Calculate the statistic 
needed to test the hypothesis. 

b. Suppose we are interested in making the change to the automated system 
only.if there is a decrease in the average dollar error per month. Write down 
the hypothesis for this case. What is your decision for this case? 


6.8 RELATION BETWEEN TESTING AND ESTIMATING m 221 


6.84 Each department in a metals-processing company inspects its finished product 


with its own micrometer. These micrometers are calibrated using “4” standard 
gauge blocks, for which the true measurement should be .025. It has been 
suggested that these micrometers are all different. To test this hypothesis, 
eightidenticalstandard gauge blocks were submitted to each departmentina random 
order and were measured. Micrometer data from the first department were: 


.0251 
.0252 
.0248 
.0252 
.0247 
:0254 
.0245 
.0246 


a. If you use an a-risk of 5 percent, do you agree or disagree with the 
suggestion that the department's micrometer needs calibrating? 
b. The second department's micrometer readings were as follows: 


.0251 
.0249 
.0254 
.0253 
.0249 
.0248 
.0248 
.0254 


Again using a = 5 percent, do you agree or disagree with the suggestion that this 
department's micrometer needs calibration? 

c. Using the confidence interval approach from Chapter 5, calculate 95 percent 
confidence limits on the true mean reading of block size for each micrometer. 
Can you say that the two true means are different? | 

d. Can you think of a better way to determine whether the two micrometers are 


different from each other? 


A good T.V. commercial has a percent recall score of 40. A new T.V. 

commercial was given a trial in a southwestern U.S. city, and the following 

percent recall scores were obtained: 42, 18, 46, 35, 51, 29, 45, 45, 36, and 35. 

a. With а = 10 percent, use the t test to determine whether this T.V. commer- 
cial's percent recall is significantly less than the standard of 40. 

b. What conclusions can you draw about this result? | 

c. How would you conduct ап experiment like this if you were trying to learn 


how good the new T.V. commercial really is? 


222 m TO REJECT OR NOT TO REJECT 


6.9 TESTS CONCERNING THE DIFFERENCE 
BETWEEN TWO POPULATION MEANS 


In Sections 5.8 and 5.9 we considered the confidence-interval estimation of 
the difference џ. – рг between two population means. In Section 5.8 we had the 
situation in which the population standard deviations gı and o are known, and 
in Section 5.9 we treated the more practical situation in which c; and с» are 
unknown. In this latter situation we restricted ourselves to the case wherein 
ап = а. In the matter of hypothesis testing we shall limit ourselves to this 
restricted practical case. 


Example 6.9.1 


In Examples 6.7.1 and 6.8.2 we considered data on strontium-90 radioactiv- 
ity in the milk of a certain given supply. A milk supply in a different region was 
checked by means of a random sample of 12 units measured for strontium-90 
activity. The resulting measurements (in picocuries per liter) were 4.6, 5.1, 8.7, 
6.9, 6.1, 5.0, 5.2, 5.0, 5.6, 5.2, 3.9, and 4.0. Are the observed means in the two 
regions significantly different, at the 1 percent level of significance? 

Assuming the radioactivity per unit to be normally distributed in each 
region, and its standard deviation to be the same in both regions, we can apply 
the probability information of (5.9.2) and conduct the test of significance as 
follows: 


Ho: ш = p2 versus Ha: pai * p2. Yı and У; normal, о: = o»; 
independent random samples, 
п = 10, n2=12, а =.01. 


If Ho is true, then 
(Y:- Yj-0 


Tm 


Reject Ho if t<—2.845 or if t2 42.845; accept Ho otherwise. 


=t with 20 d.f. 


Probability density function of 
t with 20 d.f. 


—2.845 0 2.845 


6.9 THE DIFFERENCE BETWEEN TWO POPULATION MEANS m 223 


From the sample data we have the following: 


From Example 6.7.1, 


-—" 65.3 
$ 2 TE = 5.44, 
579.22, = E "m 
4.6 21.16 2 
; 11.80 5.1 26.01 374 33. (63:3) 
ти 1.31 12 
9 87 75.69 у e 
69 4761 H 
61 37.21 
50 25.00 374.33— 4205,09 
52 2704 =—— ~ 
5.0 25.00 11 
5.6 31.36 374.33 — 355.34 
5.2 27.04 т 11 
39 15.21 
40 16.00 21899 74 
653 37433 i 


; 1180-1899 30.79 _ | 54 


S». позз поч? 
9.22-5.44)-0 | — 3.78 SUB IS ү 


pal = == = 


[1.54 | 1.54 V154+.128 У.282 0.531 
10- 5.412 


Reject "Ho. The observed mean concentrations in the two regions differ 


significantly, at the 1 percent level of significance. 
Because the test is two-tailed, the descriptive level of significance P is 


p-2-P(t7.12|20d.t.) 


In Table A-4 we see that 7.12 is beyond the largest tabulated entry for 20 d.f. 
(3.850). Since that entry is the 99.95th percentile, the probability beyond it is 
0.0005, and so P(tz 7.12 | 20 d.f.) is less than 0.0005. Twice the probability is 
thus less than double the value 0.0005, and therefore we report: 


P «0.001. 


The limits of a 99 percent confidence interval for the difference in population 
means ш: — шг can be quickly set down here, since the numerical values needed 
in (5.9.3) have already been found and used in the test of significance: 


3.78::2.845(0.531) = 3.78 154, 


Thus a 99 percent confidence interval for the difference in population mean 
strontium-90 activity in milk (picocuries per liter) in the two regions 1S 


2.21 € pı- p2<5.29. 


224 m TO REJECT OR NOT TO REJECT 


EXERCISES 


[Consider it satisfactory to assume that random variables are normally distributed and 
0,— оз in each of the following exercises.] 


6.9.1 


Company A produces a special pharmaceutical tablet on a press. The tablet 
press has two outlets. From each outlet a total of 10 tablets is randomly selected, 
weighed, and recorded. Based on the results shown below, is there a statistically 
significant difference between the two outlets? (Use а = 5 percent.) 


Left Side Right Side 
10.90 10.94 10.95 10.92 
10.94 10.94 10.92 10.93 
10.97 10.91 10.92 10.93 
10.95 10.95 10.90 10.94 
10.90 10.96 10.91 10.92 


A new brand promotion method is to be tested in one sales area. The 
effectiveness of this method will be judged by the weekly shipments of the brand 
to this area, measured as a percentage of the 1959 base period. A second sales 
area, in which the normal promotion method is continued, will be used as a 
control. Because the different promotion methods probably involve different 
sales cycles, the weekly data from the two districts cannot be paired. The 
following data are collected over a 13-week period: 


Week Test District Control District 
ЕЕЕ Е 


D Ee UH Á 
1 109 135 
2 108 101 
3 127 111 
4 136 119 
5 101 106 
6 111 117 
7 136 106 
8 109 91 
9 119 88 

10 126 92 

11 85 105 

12 145 98 

13 147 96 


а. On the basis of the above data, we wish to determine whether the new 
method is more effective than the old method. With an a risk of 0.05, could 
you say that the new method is more effective? 

[Note that H, is Ba 7 Ha, that is, i, — i; 0.] 

b. What is the best estimate of the change in shipments brought about by the 

new method? Give a 90 percent confidence interval for the true change 


6.9.3 


6.9.4 


6.9 THE DIFFERENCE BETWEEN TWO POPULATION MEANS m 225 


All accounting information is available from two plants. From plant A, 10 
random accounts were selected and checked for percent errors. From plant B, 
eight accounts were chosen at random. The results are as follows: 


Plant A Plant B 
12.1 12.4 13.6 12.5 
12.0 10.6 12.4 13.6 
12.1 11.8 11.8 12.8 
11.8 12.4 11.9 13.2 
11.5 11.9 


Would you consider these plants different? Why? (Choose your own а.) 


Twenty new accounting men are to be trained in our accounting system. Two 
methods of training have been suggested. In order to test these methods, the 
trainees are divided into two groups; group 1 is trained by method A and group 
2, by method B. After three months of training, a test set of material is given to 
all trainees. The scores are shown below. 


Training Method A Training Method B 
96 92 94 96 
90 94 93 84 
93 83 86 98 
88 80 89 95 
86 98 94 91 


a. Write out the hypothesis to be tested. | 
b. Using а = 10 percent, would you decide that one training method is better 
than the other? 

A company is interested in the ability of denture cleansers to prevent the 
accumulation of stain on dentures. Two brands were compared. Thirty subjects 
with dentures were selected at random and fifteen were assigned at random to 
each brand. АП dentures were thoroughly cleaned by the dental technician at the 
beginning of the test period. The quantity of stain was then measured at the end 
of a test period of 3 months. The following results were obtained. 


Stain Scores 
Brand A Brand B 

"- 75 5.0 65 
10.0 10.0 6.0 8.0 

X5 170 11.0 8.5 

ЕЕ 155 9.0 6.5 

8.0 9.5 РА 2 

3.0 10.0 5.0 8.0 
116 115 8.0 7.5 

8.5 60 


Using ап a-risk of 5 percent, what is your conclusion about the relative merits of 


the two brands? 


226 m ТО REJECT OR NOT ТО REJECT 


6.9.6 Two vendors have been asked to furnish random samples of "Supercleanser" 
cans made with a board that will minimize the moisture-vapor transmission гаје, 
The following figures are in grams per 100 inches squared per 24 hours. 


Supplier A Supplier B 
047 .054 
047 .052 
055 .052 

.053 .050 
.049 .051 
.047 .054 
.051 
.046 


Code the data by multiplying by 1000 and subtracting 40. 
a. Is there any difference in the means of cans supplied by the two suppliers? 


[Use а = .05.] 
b. Calculate a 95 percent confidence interval for the true difference between the 
means. 


с. What is the relationship between the confidence interval and test of signifi- 
cance in this example? 


6.9.7 Industrial engineers working with chemical engineers are responsible for making 
changes in process methods and in process equipment in order to improve 
performance. In a typical sulfonation process, an industrial engineer suggested 
making the nozzle orifices smaller in order to increase the percent completeness 
of the finished product. The following data were obtained before and after the 
change: 


—————— 


Before Change After Change 
a ee Ба нај Ме а 

95.5 96.1 96.4 95.8 
95.2 95.8 95.6 96.6 
95.0 95.8 95.6 95.7 
94.9 95.7 96.5 95.6 
95.4 95.8 95.7 95.4 
95.3 95.8 95.7 95.3 
94.6 95.4 96.2 96.5 
95.8 95.1 95.8 96.2 
95.5 95.6 96.7 96.4 
95.8 96.6 


Using a 1 test with а =1 percent, do you agree that a significant increase in 
percentage of completeness has occurred? 


6.9 THE DIFFERENCE BETWEEN TWO POPULATION MEANS m 227 


6.9.8 a. Using the following sample data, find a 95 percent confidence interval for the 
population mean: 
у Тәү, б, дуд 


b. Consider the following sample data taken from a second population: 
у: 5,6,3,2,3,4,4,5 


Test the hypothesis that the population mean in this case and that in the case 
considered in (a) are equal. (Use 1 percent level of significance.) 

c. Find a 99 percent confidence interval for the difference between the means of 
the two populations. 


6.9.9 We wish to determine whether women will use more of a certain dish-washing 
liquid if the size of the cap on the container is increased. A random sample of 16 
women was given a container with one cap size to use for a specified number of 
dish washings. Another random sample of 16 women was given a container with 
the other cap size to use for an equal number of dish washings. The containers 
with the larger caps are marked *K" and the containers with small caps, “J.” 
The following total usage data were obtained: 


M a 
Usage with J Usage with K 
(cubic centimeters) (cubic centimeters) 
Se 0-0 0 се 


55 58 
75 74 
45 47 
96 104 
38 38 
62 66 
55 57 
82 80 
75 79 
43 49 
54 57 
31 39 
44 47 
55 58 
53 51 
69 71 


a. Does the larger cap result in an increased usage? (Use an a risk of 0.05.) 


b. How would you improve this testing procedure? 


228 m TO REJECT OR NOT ТО REJECT 


6.10 TESTS CONCERNING THE BINOMIAL PROPORTION p 


We can carry forward the estimation discussion of Section 5.10 and apply 
our standard hypothesis-testing procedure to obtain tests about p. The essential 
probability theory is stated by the relation (5.10.3), so that, if Ho hypothesizes 
that p has a certain specific value, say po, then under Ho we have 


PR. if npo? 5 апа пао> 5, where qo= 1- ро. 

„| Pode 
n 

From this we proceed as in our z tests concerning a mean p. (Note that the 

difficulty we had in Section 5.10, where we had to use p in the standard error, 

does not arise here, since po is hypothesized and used wherever needed.) 


Example 6.10.1 


A random sample of 150 voters drawn from a certain town is questioned for 
opinion about a town ordinance under consideration. Favorable opinion is 
given by 81. Is this proportion significantly less than 60 percent, at the 1 
percent level of significance? 


Но: p=.60 versus Ha: p <.60. p = binomial proportion; 
random sample, n = 150; a=.01. 


If Ho is true, then 


р—.60 — Z 
[(.60)(.40) 
150 
Reject Ho if z < —2.326; accept Ho otherwise. 
3 
P7150 


= 0.54=0.60  -0.06 _ =0.06 -0.06 __| <9, 


Раса iab Ac) ЈЕВР Fas, ЛЕ = 
/(.60)(.40) [2400 4.0016 0.04 
150 150 


Accept Ho. The observed sample proportion of voters in favor of the proposed 
ordinance is not significantly less than 60 percent, at the 1 percent level of 
significance. 


= 0.54, 


P = P(Z < —1.50) = 0.0668. 


6.10 TESTS CONCERNING THE BINOMIAL PROPORTION p m 229 


Ifin addition a confidence interval for p is desired, using Section 5.10, a 99 percent 
confidence interval would have limits: 


GNA _ 544-5 58, [2484 


.54 +2. 
5422.58 150 Eo 


.54 +2.58 v .00166 = .54 + (2.58)(.0407) 
.54+.105, 


giving the 99 percent confidence interval: 43.5 percent < p < 64.5 percent 


Example 6.10.2 

Refer to the student data given in Chapter 1, in particular the opinion on 
whether marijuana should be legalized. Responses 1 ("strongly disagree") and 
2 (“mildly disagree") indicate opinion against legalization. At the 5 percent 
level of significance, test the hypothesis that the population from which the 
sample was drawn has 25 percent of its members unfavorable to legalizing 
marijuana, as against some other percentage. 


Ho: р = .25 versus Ha: p#.25. р = binomial proportion; 
random sample, п = 180; a= .05. 


If Ho is true, then 
p—.25 
Jes 
180 
Reject Ho if z « —1.960 or if 7 241.960; accept Ho otherwise. From the 
sample data, giving 63 replies in categories 1 or 2, we have 


1 
035-025 ОЛО 0039 0040... 4 


221025375) [1875 v.00104 0.0322 
180 180 


Reject Ho. The observed sample proportio 
ing marijuana is significantly different from 
significance. 

Using the closest z value (3.090) in Table A-3, we have 


Р =Р(223.11 ог Z < —3.11) - 2(.0010) = .0020. 


n of students unfavorable to legaliz- 
25 percent, at the 5 percent level of 


230 m ТО REJECT OR NOT TO REJECT 


6.11 TESTS CONCERNING THE DIFFERENCE 
BETWEEN TWO POPULATION PROPORTIONS 


In Section 5.12 we worked out a procedure for constructing an approximate 
confidence interval for the difference p; — p; between two population propor- 
tions. For this we used the relation (5.12.3) as a starting point and then 
introduced further approximation by using estimates for pı and p; in the 
standard error, thus taking the probability structure as 


(à: —$2— (p. ez И: пара, пй, nap, naq» (6.11.1) 
pid: , pd? are all greater than 5. 
ni n2 


In the customary test of “the significance of the difference between (sample) 
proportions,” the null hypothesis Ho specifies no difference; Ho: pi p» ог 
pi— p» = 0. Under this hypothesis the estimates p; and p» are both estimates of 
the same parameter value, say p, the common value of pı and p» under the 
hypothesis that pı = p2. In this circumstance f: and р» should be pooled to give 
a single estimate of the single value p, just as we pooled si and 52 when we 
assumed oj = с5 = c^. Again we use a weighted average, this time using sample 
size as weight. This leads to a particularly simple and sensible pooled estimate: 


p= nipit nop; Total number of “heads” in both samples 


mi*n» Total number of observations in both samples on 


This estimate replaces р. and р» in (6.11.1) when we are operating under the 
hypothesis that pı = рг. That hypothesis also causes р‹— р» to be zero, and so 
the modification of (6.11.1) gives us the following probability relation to use in 
our test of hypothesis. 


Under Ho: pi=p2, 
OS eS ee (6.11.3) 


The approximation is considered satisfactory if nip, mg, пр, под are all 
greater than 5. 


Example 6.11.1 


Test the significance of the difference between males and females as (0 
unfavorable opinion on legalizing marijuana, according to the sample data in 
Chapter 1. Use the 5 percent level of significance. 


Ho: pr = рм versus Ha: pe Рм. pr = binomial proportion, females; 
рм = binomial proportion, males; 
independent random samples, 
пе = 76, пм = 104; а = .05. 


6.11 THE DIFFERENCE BETWEEN TWO POPULATION PROPORTIONS m 231 


If Ho is true, then 
Haero 
PO Pay 
76 * 104 
Reject Ho if z « —1.960 or if z =+1.960; accept Но otherwise. From the 


sample data we have 


32 
pr=32=0.421, фм == 0.98, p= X qu 0.350 
PH 0.421—0.298 отео 203 
(350).650), (350)(.650) [2275 , 2215 
76 104 76 ' 104 
_ 0.123 
~ 4.00299 + .00219 


, 0.123 U 
V.00518 0.0720 — 


Accept Ho. The observed difference between male and female students as to 
sample proportion unfavorable to legalizing marijuana is not significant at the 5 


percent level. 


Р=Р(2=17 or 2<-1.7) = 2(.0446) = .0892. 


If we want a confidence interval for the difference ре—рм we must use 
(5.12.4), keeping p: and р» distinct in the standard error rather than using 
pooled p as in the above test. The test proceeds under a null hypothesis that 
Pr = pw; such a hypothesis is clearly not applicable when we seek bounds on a 
nonzero difference. The limits of an approximate 95 percent confidence inter- 


val for pr— рм are 


S19) , (298)C702 
10.298) + 1.96. (206979), Зана 


|.2438 .2092 
= 0.123 +1.96 726 + 104 
= 0.1232: 1.96V .00321 + .00201 
= 0.123 +1.96/.00522 


= 0.123 + (1.96)(.0722) 
= 0.123+0.142, 


(0.42 


50 that an approximate 95 percent confidence interval is 


—0.019 < pr- рм < +0.265 


ог 
—1.9 percent < pe- pu^ +26.5 percent. 


232 m ТО REJECT OR NOT TO REJECT 


Notice that the confidence bounds extend from where the female proportion is 
1.9 percentage points lower than the male to where it is 26.5 percentage points 
higher. This is of course consistent with being unable to reject a null hypothesis 
of no difference, since the value pr—pm = 0 lies within the confidence interval. 


EXERCISES 


6.11.1 


6.11.2 


6.11.3 


6.11.4 


6.11.5 


6.11.6 


6.11.7 


6.11.8 


Suppose that for a certain disease the mean mortality is 36 out of 100 
attacks. If under a new treatment there are 120 deaths out of 400 attacks, what 
would you say, at the 1 percent level, about the new treatment? 


The probability of winning a game of *craps" is 0.495. Suppose a player wins 
60 games out of 100. At the 1 percent level, would you consider this a 
significant deviation from the expected? What would be your decision at the 5 
percent level of significance? 


In a test of effectiveness, 200 insects were sprayed with insecticide A and 300 
were sprayed with insecticide B. The numbers of insect deaths were 150 and 
210, respectively. At the 5 percent level, is there a significant difference in 
effectiveness between the two insecticides? 


With the standard process of manufacturing a certain article, 5 percent of the 
produced units are defective. A new time- and money-saving process will be 
installed if it does not significantly increase this proportion of defectives. A test 
run of 900 units produced by the new process shows 55 defective. At the 5 
percent level of significance, what conclusion would you draw? 


It is believed that 20 percent of the voters in a certain community are 
Independent voters. A poll is taken of 196 voters constituting a representative 
sample. Of these, 27 state that they are Independent. Does this result support 
the belief at the 5 percent level of significance? 


Referring to the mean proportion of deaths given in Exercise 4.9.2, consider a 
group of 1000 persons of age 60 about whom there is a strong presumption of 
unusually good health. If 200 of this group die before age 70, what conclusion 
(at the 1 percent level of significance) could you draw? 


A random sample of n — 300 housewives was given two products, A and B, to 
use. One week later each housewife was asked which product she preferred. 
Fifty-five percent preferred A. Is this sufficient evidence with a — 5 percent to 
conclude that A is truly preferred to B? 


In a poll of 100 Independent voters in a local area, 40 percent stated that they 
would vote for candidate A, 48 percent said that they would vote for candidate 
B, and 12 percent said that they were undecided. Assuming the 12 percent will 
vote 6 percent for A and 6 percent for B, is this enough evidence for you to say 
B will win the election? Why or why not? 


6.11 THE DIFFERENCE BETWEEN TWO POPULATION PROPORTIONS m 233 


6.11.9 A test to determine whether a new deodorant (D) is preferred to a standard 
well-known brand (S) was conducted in two different cities. In city A, 52 
percent of the n — 200 people tested said that they preferred brand D, the new 
brand. In city B, 55 percent of the n =200 people tested said that they 
preferred brand D. 

a. Using а = 10 percent and these data, is there sufficient evidence to state that 
the preference in city B for product D is significantly higher than in city A? 

b. Overall, how many people preferred product D? If we pool all of the data, 
what proportion of people prefer product D? 

c. Using the pooled data and a —5 percent, test the hypothesis that the 
products S and D are equally preferred. What conclusion do you make? 

d. Comment on the assumptions one makes by pooling these results from the 
two different cities. 


7.1 INTRODUCTION 


There are many situations when 
qualitative or “nominal” data are 
collected and form the major evi- 
dence for or against certain conjec- 
tures or hypotheses. One of the 
most useful techniques employed 
for analyzing these data is the chi- 
square test of hypothesis. In this 
chapter, we will cover only one of 
the more elementary topics that 
utilize the chi-square (x^) dis- 
tribution; namely, the "хс contin- 
gency table. For those interested in 
other uses of chi-square tests, a 
brief outline of other topics has 
been included. 


7.2 A BINOMIAL PROBLEM 


Let us consider another simple 
application of the binomial dis- 
tribution. A coin is tossed 100 
times and the following results 
(Table 7.2.1) are obtained: 


TABLE 7.2.1 


Heads Tails Total 


60 40 100 


As a result of this experiment, is 
there sufficient evidence to state 
that the coin is biased, that is, more 
apt to give heads than tails, or vice- 
versa? This question can be pre- 
cisely stated in the form of a test of 
the hypothesis: 


Ho: ри=рте .50 


Sorting Out 
The 


Categories 


7.2 A BINOMIAL PROBLEM m 235 


TELL IT LIKE IT IS* 


"I'M 56% IN AGREEMENT, 31% IN | 
DISAGREEMENT, AND 13% UNDECIDED, 
*By Ralph Dunagin. Courtesy of Field Newspaper Syndicate. 


versus the two-tailed alternative: 

Ha: put .50 
Let us prespecify the probability of rejecting the hypothesis of a fair coin when 
we should not do so at а = 5 percent. The binomial-distribution approach to 


this problem is to calculate the observed proportion of heads from the sample 
(Bu) and use the normal approximation to the binomial (since np, 75) to test 


the hypothesis Ho. We calculate: 


with the value of Z for the normal 
he area in each tail of the distribution. 
3), we find this to be 


and compare this calculated z-value 
distribution that cuts off 2.5 percent of t 
Using the table of the normal distribution (Table A- 
2*= 1.96. 


In this example, the observed proportion and the calculated z are shown as 


follows: 


236 m SORTING OUT THE CATEGORIES 


Since the calculated value of z —2.00 is greater than the critical value of 
zx = 1.96, the experimenter concludes that the coin is biased, rejects Ho, and 
accepts Ha. 


7.3 1x2 TABLES 


Another approach to the solution of this type of problem is through the 
use of the more generally applicable x^ distribution. The x^ statistic is 
defined as follows: 


xi») (ee (7.3.1) 


where o; = observed frequency in the ith category 
e; = expected (theoretical) frequency in the ith category 
К = number of categories into which the data are divided 
Xi = а random variable that has the x? 
distribution with | degrees of freedom. 


The probability pattern of the x^ statistic was worked out by Karl Pearson 
and К. A. Fisher in the early 1900s. As in the case of Student's t, there is а 
different distribution for every different number of degrees of freedom. Tables 
of these distributions are readily available (e.g., Table А-5). Figure 7.3.1 
graphs a few illustrative cases. 


Let us consider our binomial problem in terms of x^. Under the hypothesis 
that the coin is unbiased, the expected number of heads in 100 tosses of the 
coin is €(Y), where Y has the binomial distribution with n = 100 and р=ро= 
50. Thus the expected frequency in the first category (heads) is 100(.50) = 50, 
and then the following table (Table 7.3.1) can be constructed: 


0 2.0 3.841 


0.5 


5% 


0 3.0 5.991 


596 


0 8.0 15.507 


FIGURE 7.3.1 x? distribution with / degrees of freedom. 


237 


238 m SORTING OUT THE CATEGORIES 


TABLE 7.3.1 
Heads Tails Total 
о -Observed o0,=60 0,740 100 
e,=Expected e,=50 e,=50 100 
Using these values, we calculate 


:_ F (0-е) 


x 
i=l а 
_ (60—50)? + (40— 50)* 
XT 50 50 


_ (10)?  (-10) 100, 100 
= + 


50 50 50 50 
x7 — 4.00 


The number of degrees of freedom | associated with the random variable yi 
in tests on categorized data depends on the number of parameters in the 
probability model assumed to be the source of the data and the number of 
parameters in that assumed model after the null hypothesis Но makes the 
underlying general model more specific. Recall that a parameter in a probabil- 
ity model is an arbitrary constant whose numerical specification helps to pin 
down a specific member of a family of probability distributions. For example, ш 
and c are the parameters in the family of normal distributions, and n and p are 
the parameters in the family of binomial distributions. 


When x; is the statistic associated with an array of categorized data, | is 
defined as 


number of independent 
parameters in the assumed 
А (7.3.2) 
underlying model as 
modified by Ho 


ў Number of independent 
=| parameters in the assumed |— 
underlying model 


7.4 1xc TABLE m 239 


In our coin example, the underlying model (for the first part of Formula 
7.3.2) is the binomial distribution with n — 100 and general p. In that model the 
parameters are p and q, subject to the restriction p+q=1. Thus there is one 
independent parameter in the assumed underlying model. For the second part 
of Formula (7.3.2), the null hypothesis specifies that the distribution has 
п= 100 and also p = 0.5; thus the number of parameters under Но that are left 
unspecified in the assumed model is zero. Thus 


121-0721, 
and the x^ statistic is xz: Х with 1 d.f. 


Referring to Figure 7.3.1 of the x’ distribution with |= 1 degree of freedom, 
using Table A-5 for specifics, we find that the critical value of xi (say xj.) is 
3.841 for а = 5 percent. Since the observed value X1 74.002 x;. = 3.841, we 
reject Ho and accept Ha; that is, these observations provide sufficient evidence 
to say that the coin is biased. 


This x^ procedure and its result ought to be completely equivalent to the test 
we had earlier performed by use of Z. It happens that the equivalence is 
mathematically precise because the probability pattern of 7° can be proved to 
be exactly the pattern of x? (x^ with 1 d.f.). Thus, the critical values match: 


zy = 1.960, z& = (1.960) = 3.842; х2. = 3.841, 


(More nearly exactly, z= 1.959964, 2% = 3.841459), and our calculated statis- 
tic values match: 
22200; х1=4.00=2° 


74 1xc TABLE 


The extension of the binomial or 1х2 table to the more general 1x c table is 
straightforward. Let us illustrate this extension with the following problem: 

A. manufacturing plant runs 24 hours a day by utilizing three shifts. Each 
shift has the same number of employees working. The medical doctor for the 
plant informs the plant manager that he is concerned about the number of 
minor accidents occurring in the plant. He states that he believes the major 
problem to be poor safety practice on some of the shifts. After some discussion 
he furnishes the following data on minor accidents by shift for the period July 1 


through December 31, 1972 (Table 7.4.1). 


240 m SORTING OUT THE CATEGORIES 


TABLE 7.4.1 


[ses [se [ons [on] 


Number of 
minor accidents 


If one hypothesizes that safety practices, susceptibility to minor accidents, 
and other conditions are the same in all shift operations, is there any evidence 
in these data to refute this hypothesis? 


On the basis of this hypothesis, the probability of a minor accident occurring 
on shift 1 = probability of a minor accident occurring on shift 2 = probability of 
a minor accident occurring on shift 3, and then the likelihood of a minor 
accident's being in a particular shift is the same for all three shifts. That is, we 
have hypotheses as follows: 


1 
Ho: p-p-p-p-3 


versus 
Ha: at least two of the subscripted probabilities are unequal. 


Let us use а = 5 percent for testing this hypothesis. Under Ho, the expected 
frequencies are calculated and recorded in the following table: 


е = = € ——— ra M 
Shift!  Shift2  Shift3 Total 


Observed - o, 25 30 41 96 
Expected = е" 32 32 32 96 


"еј = Poi Xn = (1/3) x96 = 32, ї = 1, 2, 3, 


Before calculating yi, let’s determine the number of degrees of freedom l. 
The assumed underlying model has the parameters pi, p», and ps, subject to the 
restriction pi+p2+p3=1 (a minor accident that occurs has to occur on one of 
the three shifts). Thus the number of independent parameters in the model is 2. 
Under the null hypothesis Но, all pis are equal, and hence have to be 1/3 each. 
Hence the number of parameters in the assumed underlying model under this 
specific Ho is zero. And so 122-022, telling us that we are dealing with 
X2: X with 2 d.f. 


74 1xc TABLE m 241 


The critical value of x^ that has 2 d.f. and cuts off the upper 5 percent of the 
distribution is x?. = 5.991. 
Calculations now give: 


_ 291: _ 29/2 _ 292 
= 32) „(80 32) (41 32) 


== 32 32 
49 ASSI 

=32 32 32 

=4.19 


In conclusion, since ха = 4.19 « x. = 5.991, we cannot reject Ho. Thus we 
say that using these 6 months' data, the three shifts are not significantly 
different at the 5 percent level. More evidence will be needed if we stick to our 
5 percent significance level position and persist in a search for distinctions 
among the shifts. 


This 1 хс categorical procedure is very general: one can hypothesize any- 
thing he has the urge to concerning pi, pz, ps · --, ре. For example in the above 
situation involving minor accidents on three different shifts, one could, if he 
liked, hypothesize that a minor accident is equally likely to be on shift 1 or shift 
2 but twice as likely to be on shift 3. Here we would be hypothesizing the pis to 
be pi, pi, and 2pi, and since these must add up to 1, we would have 
1-pi* pi 2p: = Ар,, giving р: = 1/4, so that our null hypothesis is 


Ho: ра 1/4, p2— 1/4, ps— 1/2 
versus 
Ha: not all of these equalities hold. 


The test statistic x^ will again be ха: x^ with 2 df. (the general 1х c case has 
C— 1 d.f.), and the calculations would be as follows: 


Shift 1  Shift2 Shift 3 Total 


0, 25 30 41 96 
e, 24 24 48 96 


, (25—24) , 30-24)" , (41—48)? 
xit eee 48 
_1 36,49 

724*24* 48 

=2.56 


Since 2.56 < x. = 5.991, we have to qs 
the Observed distribution of accidents among 
different from that which is hypothesized. 


rain from rejecting Но and say that 
the three shifts is not significantly 


242 m SORTING OUT THE CATEGORIES 


(Note: Having nonrejected two different hypotheses, we can see an argument 
for avoiding the use of “accept” as the alternative to “reject.” Accepting two 
different hypotheses seems a bit much. But we must remember that a test of 
hypothesis treats “reject Ho" as meaning “data аге not probabilistically consis- 
tent with Ho," and "accept Ho" as meaning "data are not probabilistically 
inconsistent with Ho." There can be many hypotheses falling into either 
category. A statistical test is a valid check of one hypothesis with one set of 
data. 


Greater precision, greater assurance of correct decision, and greater 
knowledge about the truth—all require more samples, larger samples, and 
continuing experimentation. That is completely consistent with what scientists 
have known from earliest times. The contribution of statistical analysis is an 
orderly procedure, with probability statements giving numerical measure to 
credibility.) 


7.5 2x2 CONTINGENCY TABLE 


The next generalization to be considered is the situation where an individual 
observation is characterized in two ways, each of which is a binomial kind of 
classification. Consider the following example: 


A sample of п = 78 housewives was chosen at random and each was asked 
the following two questions: 


1. What brand of soap are you using in your bathroom at the present time? 
2. What product do you use for laundering your regular clothes? 


Based on a knowledge of the brands made by Procter and Gamble, data 
from the housewives’ responses were tabulated in Table 7.5.1. 


TABLE 7.5.1 


Question 1 (Soap) 


Totals 


Question 
2 
Laundry 
Product 


Procter and Gamble 
Brands 


The question of interest in this problem is whether or not the choice of à 
bathroom-soap product is independent of the choice of a laundry product. 


75 2x2 CONTINGENCY TABLE m 243 


The null hypothesis Ho is a statement that the choices are independent (i.e., 
there is no association between soap choice and laundry-product choice). The 
rejection of this hypothesis would be of considerable importance to Procter and 
Gamble: it would indicate that the Procter and Gamble label in the area of 
soap products covering both bathroom bars and laundry soaps is important in 
maintaining market position. 


This hypothesis must be written in probability terms as follows: 


Let р, = probability of using laundry products in the ith category: 
p, = probability of using a Procter and Gamble brand of laundry 
product, 
pə = probability of using another brand of laundry product ; 
р, = probability of using bathroom bar soap in the jth category: 
р. probability of using a Procter and Gamble bar soap, 
р = probability of using another brand of bar soap; 
p, = probability of using both the ith category laundry product and 
the jth category bar soap product. 
(Note the use of i to index row categories in the data table, j to index column 
categories, i number first and j number second. Thus р, = probability of being 
in the first-row category and second-column category. This is standard usage in 
mathematics generally.) 


Using our probability definition of independent events (Chapter 4, Definition 
4.4.1), we can state our hypothesis in these terms: 


Ho: p,7p,.p,  foralij(i-l2 j-12 
versus 
Ha: at least one of the stated equalities does not hold. 


We will use the а = 5 percent significance level for testing the hypothesis of 
independence in our example. Since all of these probabilities are unknown and 
we are not making hypotheses about their specific values, we will estimate 
them from the data as follows: 


First let us repeat the data table with some useful notation added (Table 


TS: La): 


244 m SORTING OUT THE CATEGORIES 


TABLE 7.5.1.а 


[seen | 
Procter and Gamble | Other 
Brands Brands 
Procter and Gamble 
Laundry Brands 0327 15 


Products 
они ван | mom [onn] 


Totals 


Our best estimate of the probability of using a Procter and Gamble laundry 
brand is: 


puni, 57 
a nes 78 
Our best estimate of the probability of using another laundry brand is: 
jou 
Sunny 78 
i б +A, = 2.57, 21. 
[Notice that p, +p, = 1 ~ 778 78^ 1] 


Our best estimate of the probability of using a Procter and Gamble bathroom- 
soap bar is: 


„ылы 42 
Piso TR 

Our best estimate of the probability of using another bathroom-soap bar is: 
p,- 12.26 
Ра 78 


[Again p, +f, = 1] 

Based on the null hypothesis Ho: Dj 7 р, Pp We argue p; = p. pj and then 
calculate the expected number of observations in each of the cells of the table in 
the following way: 


e;7nxf,xp,, 


where е, = expected number of observations in the (i, j)th cell, 
n — total sample size. 


75 2x2 CONTINGENCY TABLE m 245 


Thus we obtain: 
e17 78x p. X pa 


2, 57, 52_ (57)(52) _ 2964 
78Xz8*78- 78 78 


= 38, 
12 = 78X pu X pa 


_ 51,26 _ (5726) _ 1482 
78Xz8*78- 78 — 78 


ел = 78 X р. хра 
21_ 52 _ (21)(52) _ 1092 


-78X78X787 78 78 
=14, 

en 7 8X pi Xa 
аи 21 26546 
=78х7878 78 
=7. 


You may have noticed that each of the above calculations boiled down to 
row total multiplied by column total divided by grand total. This is no mere 
coincidence. In every rXc contingency table the expected numbers are given 


by: 


Placing our e; in the same table with the data, we now have Table 7.5.1.b: 


TABLE 7.5.1.b 


Bar Soap 


Procter and Gamble Other 
Brands Brands 


Procter and Gamble 04742 0127 15 
Brands 64738 е„=19 || n.757 


Totals 


Laundry 
Products 


246 m SORTING OUT THE CATEGORIES 


To determine the number of degrees of freedom |, we count the number of 
independent parameters in the underlying model and subtract the number of 
independent parameters left after the null hypothesis is applied. In the model 
we have: 


Pu, P12, P21, P22, Subject to totaling 1. 


Another way of saying this is that every housewife in the study must fall into 
one and only one of the four cells. Thus if we write the table in probability 
notation we would have: 


Thus in the underlying assumed model there are four parameters, pii, pi, рз, 
and pz, but they must add up to 1. Thus there are but three independent 
parameters in the underlying model. 


Under the null hypothesis of independence, Ho further specifies that pj = 
D.X pj. As you can see from the above table, pi.-- p; — 1 and р.+р = 1. Thus 
of these four, namely p, p>, P.1, рг, only two are independent. Then using rule 
7.3.2, 1=3–2=1 d.f. Our test statistic is thus X^ with 1 d.f.: х2. The critical 
value of х? when а = 5 percent is x7. = 3.841 (Table А-5). 

Using the o; data and the calculated ej values in Table 7.5.1.b, we can now 
calculate x7. 

xi- X (оу — ey)” 


all four ei 
cells 


_ 329)2 = 2 - 2 NE 
(42-38) 40519 , C0 14) +020 


38 14 
_16, 16 , 16 16 
“38° 19*14* 7 
x17 4.69 


In conclusion, since the calculated value of Х?= 4.69 exceeds the critical value 
of х2. — 3.841, we reject Ho and conclude that there is sufficient evidence in 
these data to say that the choice of bar soap is not independent of the choice of 
laundry products among the women whom these panel women represent. We 
say that the observed association between selection of bar soap and selection of 
laundry products is significant at the 5 percent level. 


7.6 THE rxc CONTINGENCY TABLE m 247 
76 THE rxc CONTINGENCY TABLE 


The next generalization to be considered is the case where an individual 
observation is characterized in two ways, each with more than two levels. We 
think of r categories for the row characteristic and c categories for the column 
characteristic. For example, a family could be classified by the number of 
children in the family and by the family income. An example of this is shown in 
Table 7.6.1. 


This type of table is called an rxc contingency table, in this case a 3x4 
table—that is, three rows and four columns, giving a total of rxc=12 
categories. 

(The student should note that in this example the columns represent discrete 
data, while the rows represent an arbitrary division of a continuous variable, 
income, into three discrete units. This latter procedure is a common one used 
in many fields, and while it can be useful, any class-intervalizing of data like 
this reduces the information content of the data. We must weigh the magnitude 
of this information loss before doing the grouping as illustrated in Table 7.6.1.) 


TABLE 7.6.1 


Number of 
Children 


‘amily 
Income 


Under $2000 


2000-5000 ezn = 19.4 


Оза = 10 
63,7 9.5 


Омег 5000 


Totals 


were obtained by choosing n — 257 families 
me and family size for each and filling in the 
1 total fixed in advance was the grand 


The data (the 0,5) in Table 7.6.1 
at random, determining family income 
table accordingly. Thus the only margina 
total, n = 257. 

In this case, the hypothesis of independence is s 
family income are independent of each ot 


tated: 


Ho: Family size and her, 


248 m SORTING OUT THE CATEGORIES 


In terms of probability, let 


р, = probability of the occurrence of the ith family income, 
p, = probability of the occurrence of the jth family size, 
р, = probability of the simultaneous occurrence of the 

ith family income and the jth family size. 


Then our hypothesis is specifically the following: 

Ho: p,7 p,Xp, for all i, j (i7 1,2, 3, = 1, 2, 3, 4), 
versus 

Ha: at least one of the stated equalities does not hold. 


Since we have n families, the expected number of families falling in the (i, th 
category or cell of the rx с table is np, =n X p, X p, if the null hypothesis Ho is 
true. 


Since these probabilities are unknown, estimates of them must be obtained 
from the data: 


ба Number of families falling in the ith income class 


i Total number of families in the sample 


PUN 
ыр 


п 


куе Number of families falling in the jth family size class 
cx 


Total number of families in the sample 


nj 
и“ 


_ The expected number of families having the ith income level and the jth 
family size is then np, = пр.ру, giving 


а= Xu (7.6.1) 
as in (7.5.1). Based on the data given in Table 7.6.1, the expected numbers е 
have been calculated and shown in the same table. 


To determine the number of degrees of freedom for X^. we count indepen- 
dent parameters as before. In the underlying model we have 11: 


Рт, P12, pis, pia, P21, P22, P23, P24, P31, рэг, P33, рза, subject to totaling 1. 


According to what Hy specifies in the model, the parameters are: 


pi P2» рх. subject to totaling 1; 


ра, ро, рз, Ра, Subject to totaling 1. 


7.6 THE rxc CONTINGENCY TABLE m 249 


These show that the independent parameters number 2+3=5. Thus l= 
11—5 =6, and so our x? is x2 with 6 d.f. 


The argument is easily generalized to any r x c table associated with the null 
hypothesis of independence. With r rows and с columns, there are rc cells 
(categories) altogether, and hence rc pis, subject to the restriction that they add 
up to 1. Thus, the underlying model has (rc — 1) independent parameters. 


According to the null hypothesis Ho, the parameters are as follows: 
pis Ра, - --, p», Subject to totaling 1 (r—1 independent parameters); 
pas Da, - -<> Ро Subject to totaling 1 (c —1 independent parameters). 
Hence 
1= (rc 1) - [(r— 1) * (c - 1)]. 
A bit of algebraic persistence here brings out a beautiful result: 
l2 rc-1-(r*c-2) 
=то—с= 71 
= c(r—1)-1(r- 1) 
=(г— 1)(c - 1), 


and so we have the easy general formula: 


1066-1). (7.6.2) 


Notice that our example, having a 3х4 table, gives | =(3—1)(4- 1)=2:3= 
6, as we found with harder direct work earlier. Notice also that | = (r— 1)(c— 1) 
is precisely the number of еџѕ which you can individually compute before the 
marginal totals force the final cell in each row and each column to come out 
just right for the sum. 

Returning to the test of our example, let us take the level of significance 
а = .01. Then the critical value for the .01 level of significance is Xi. = 16.812. 

The value of the observed х? can now be obtained for testing the hypothesis 
of the independence of family size and income. 

12 y (04 — eu)” 


all 12 eij 
cells 


250 m SORTING OUT THE CATEGORIES 


A tabulated worksheet will help organize the calculations and speed up the 
computation by putting the arithmetic on a production-line basis. 


Worksheet 

о е о-е (o-e)? (o-ey/e 
15 25.2 =10:2 104.04 4.129 
27 40.4 — 13.4 179.56 4.445 
50 37.3 12.7 161.29 4.324 
43 32.1 10.9 118.81 3.701 
25 15.3 9.7 94.09 6.150 
37 24.6 12.4 153.76 6.250 
12 22.7 –10.7 114.49 5.044 

8 19.4 -114 129.96 6.699 

8 7.5 0.5 0.25 .033 
13 12.0 1.0 1.00 .083 

9 11.0 -2.0 4.00 .364 
10 9.5 0.5 0.25 .026 


Since хе= 41.248 exceeds the critical value 16.812, we reject Н, and con- 
clude that the observed association between family size and family income is 
significant at the 1 percent level of significance. 


7.7 OTHER USEFUL x? TESTS 


There are many other uses for the x’ distribution. For example, x^ is used 
for testing the fit of data to a completely theoretical probability distribution, 
the goodness of fit test. It is used to combine the probability levels of several 
tests of significance. For our purposes, we shall discuss two other interesting 
uses of the x^ statistic: the test of homogeneity and the test for a shift in 
binomial proportion. 


A. Test of Homogeneity (2x2 case) 


A test panel of 300 women were asked to try two products, A and B. Fifty 
percent of the women tried A first and B second, while the other 50 percent tried 
B first and A second. The data were recorded in Table 7.7.1. 


TABLE 7.1 

ее — 2 C 
Order Order 
AB В—А Totals 


Preferred A 105 60 165 
Preferred B 45 90 135 
Totals 150 150 300-n 
A——— peace v ==" 


7.7 OTHER USEFUL y? TESTS m 251 


Note that one of the margins has fixed totals. (Column totals were fixed by 
assigning 150 women to Order A— B and 150 women to Order B— A.) 
When this is true, the hypothesis is framed in the sense of homogeneity. The 
hypothesis to be tested in this case is: the preference between A and B is the 
same for those women who tried the products in the order A — B as for those 
who tried the products in the order B — A. 


The procedure of analysis turns out to be the same as the one used to 
test the independence hypothesis in the 2X2 table discussed previously, 
although the logic is different. 


Here we are checking whether the row-wise category breakdown in column 
1 is significantly different from that in column 2. (The query is whether the 
columns are homogeneous as to row breakdown; thus the name of the test.) If 
the null hypothesis is true, the probability breakdown by row is the same for 
both columns. Thus we would estimate an overall probability of being in row 1 
(preferred A), and since the overall number of women who preferred A was 
165, the estimate would be 
_ 165 
Р. ~ 3090 


Similarly the overall probability of being in row 2 (preferred B) would be 
estimated by 
_135 
Р. 300 


Then we would expect the 150 women in column 1 to be distrib 
135 150x 135 64 5 


uted as: 


en= 150 3597 39g 825, e1= 150 300“ 300 2 


and the 150 women in column 2 to be distributed as: 
135, 150x135 . 67.5. 


ег = 150 165 = 1805168 82.5, en = 150 200 ~ 300 


Note that each e comes ош to be in accord with our earlier rule (7.6.1): row 


total times column total divided by grand total. i . 
What about degrees of freedom? The underlying model has the following 
parameters: 


Order A > B: ри, poi subject to totaling 1; 


Order B— A: р, pz; subject to totaling 1. 


252 m SORTING OUT THE CATEGORIES 

Thus the model has 1+1=2 independent parameters. After the null 

hypothesis Но is applied, arguing pi: pi? and poi = po», the parameters are: 
Di, p2, subject to totaling 1. 


Hence there is now just one independent parameter, and so the rule (7.3.2) for 
degrees of freedom gives 

l=2-1=1. 
We note that this is the same as in the 2x2 table testing independence. We 
wind up the test in the usual manner: 


2 =. 2 Ёл 2 E 2 
> _ (105 — 82.5) 445 67.5) , (60 82.5) 4 Q0 67.5) 


xi 82.5 67.5 82.5 67.5 
= 6.136+7.500 +6.136+7.500 
x122727 


Since the calculated value of у = 27.27 is greater than the critical value of 
Xi. = 6.635 for а = 1 percent, we conclude that the order of trying the products 
definitely affects the preference for the products. 


B. Test of Homogeneity (rx c case) 


Let us extend the 2X2 case to a more general rXc situation. Here we 
consider the question: Is the distribution of a certain characteristic the same for 
several populations? In the previous 2X2 case the question was phrased: “Is 
the distribution of preference for products A and B in the population of 
women who try them in the order A then B the same as in the population of 
women who try them in the order B then A?" 


There are many examples of this type of experimental data: a market 
research company conducts a four-product test in each of five different cities 
and records the number of people preferring each product; a political pollster 
selects four different voting districts and gives the sampled people a choice 
among four different candidates for the presidency in 1976; a pharmaceutical 
firm tries out three new products for the relief of headaches in people of four 
different economic levels, each person trying all three products and stating 
which product he thought best, and so on. 


Example 7.7.1 


During World War II a survey of men drafted into the Army was 
conducted on their attitudes toward the draft. It was thought that the 
amount of prior education before induction into the army would have a 
strong influence on attitude toward the draft. A random sample of пл. = 200 
men with less than a ninth-grade education (i.e., having just an elementary 
education) was drawn, and also random samples of n; = 100 men with a 
high-school diploma, and n;—50 men with at least two years of college 


77 OTHER USEFUL x° TESTS ш 253 
education. The data collected are shown as follows 


How Do You Feel 
about the Draft? 


Don't Think Think It 
It Fair Fair 


Totals 


Education At Least Two 


Years College 


Is there sufficient evidence in these data to say that the distribution of the 
attitudes toward the draft differs among the three “level of education” popula- 
tions? 

The null hypothesis is that the distribution is the same for all populations. In 
this example we have taken the populations as given by the rows in the table. 
Under Ho, 

p; = probability that an individual from any 
row population falls in the jth column. 


We estimate р, by 
bi =H, and 


then our estimate of the expected number in the (i, j)th cell of the table is 


nj; ninj 
e TN 


n 


= 


Notice that this number is mechanically identical to the ej obtained in the 
situation of the x^ test of independence: row total times column total divided 


by grand total. 


Let us look at the degrees of freedom generally in an rXc case like the 
Present one. There are r rows, each representing a particular population. Each 
tow is broken down into c column categories. The parameters in the underly- 


ing model are: 
Row 1: pu, pis - «s Pio Subject to totaling 1, 
Row 2: pa, p22,- --» рге, Subject to totaling 1, 


Row г: ра, pas -- -> ре Subject to totaling 1. 


254 m SORTING OUT THE CATEGORIES 


Thus each row contributes (c — 1) independent parameters, and, since there are 
r rows, there are r(c — 1) independent parameters altogether. According to the 
null hypothesis Ho, the probability distribution is the same in every row, and so 
under Њ the parameters are: 


ра, P.2,+++5 Ре Subject to totaling 1. 


Hence there are now (c — 1) independent parameters, and thus the rule (7.3.2) 
gives the number of degrees of freedom as: 


I2 r(c-1)-(c- 1) = (c - 1)(r- 1), 


the same as (7.6.2). If we switch rows and columns in the argument, taking 
each column to represent a population, and the rows to show category 
breakdown (we had this setup in the 2x2 example of women with product 
preference), then we get 


l2c(r-1)- (r- 1) = (r- 1)(c - 1), 
again the same result as before. 


Returning to our data on attitude toward the draft, we calculate the number 
of degrees of freedom as 


l-(r-1)((c-1)2(3-1)2-1) 22-1272, 


and then proceed with the test: 


Ho: Distribution of draft attitudes is the same for all 
three educational levels. 


If Ho is true, then 
(o — ey 


all six [4 
cells 


=’ with 2 d.f. 


Taking а = 5 percent, we set up ће decision rule: reject Ho if x2 7 х2. = 5.991, 
do not reject Ho otherwise. 


Draft Attitude 


Education 


At Least 2 
Years of 
College 


77 OTHER USEFUL x° TESTS ш 255 


(о–е) (o-e)? _ (о–е)је 


31.14 969.6996 8.747 
-31.14 969.6996 10.878 
-1143 130.6449 2.357 
11.43 130.6449 2.931 
-19.71 388.4841 14.020 
+19.71 388.4841 17.429 


56.362 


= 


Thus, we have у; = 56.362. 


The conclusion is to reject Ho. The proportion of men who think the draft was 
fair is different for the different “educational-level” populations. 


C. Test of Shift in Binomial Proportion 


A very interesting application of the x’ test is the test of whether or not 
there has been a shift in a binomial proportion. A typical example is the 
following: One hundred women in a panel test were asked for their preference 
between two brands of coffee, A and B. For the 2 months following the test, 
the area from which these women came was bombarded with an advertising 
campaign on the excellent qualities of brand A coffee. At the end of the 2 
months the same 100 women were brought back and given another blind 
preference test. The results of the two tests were combined and shown in the 


following table: 

Product Preferred 

in Second Test 
rea] se all To | 


This table looks like an ordinary 2X 2 contingency table but there is a very 
special peculiarity about it. It is addressed to the query as to whether opinion 
before the advertising campaign is homogeneous with opinion after. But it 
cannot allow us a test of homogeneity because we have only one single sample, 
used both before and after, rather than a sample from the before population 
and an independent sample from the after population. The table can give us à 
X! test of independence (one sample categorized in two ways), but here that is 
of no real interest: we do not need a statistical test to determine that the 
opinions of a set of people at one time are associated with their opinions at 
another time. 


256 m SORTING OUT THE CATEGORIES 


We do want to test the null hypothesis that the probability of a per- 
son's preference for brand A before the advertising campaign is the 
same as the probability of a person's preference for brand A after the 
campaign. But with our experimental setup, this means: 


Ho: Pu” Pas 


and that is not the sort of null hypothesis handled by the standard x^ tests on 
contingency tables. 


Theorists have worked out an acceptable test procedure, which has the 
added advantage of intuitive appeal since it is based on assessing the switch 
votes. If there were no change in the overall group preference between brands 
A and B, we would expect all of those whose preferences switched from brand 
A to B to be counterbalanced by those whose preferences switched from brand 
B to A. Thus we consider only the 30 people whose preferences switched. 
Under Ho, we would expect these to be split 50/50 between the two kinds of 
switching, and so take е1 = 15, ez: = 15. The resulting x^ has 1 d.f. 


(9— 15)? (21—15)* 
15 15 


236,36. 
- 15 +157 480. 
This result is significant at the 5 percent level since x; = 4.80 is greater than the 
critical value x}. = 3.841. Thus at that level of significance we reject Ho and 
state that the observed change in preference has been significant. This is a good 
indication, then, that the advertising campaign was indeed successful. 


a es 
i= 


7.8 EXERCISES 


7.8.1 It has been conjectured that the “serious” injury that occurs in football 
competition occurs in the latter part of any specific game. Some data were 
collected and are shown in the following table: 


Time Since Start Number of 
of Game (minutes) Serious Injuries 
0-21 57 
22-42 88 


Do you agree that there is a difference between the proportions of injuries in 
the two halves? Use а = .05. 


7.8.2 


7.8.3 


NES 


7.8.6 


7.8 EXERCISES m 257 


Suppose the table in Exercise 7.8.1 were as follows: 


Time Since Start Number of 
of Game (minutes) Serious Injuries 


0-21 70 
22-42 75 


SS 


At the same level of significance, what would be your conclusion now? If the 
two tables were based on national experience for the last 2 years, what would 
be your attitude toward the conjecture stated in Exercise 7.8.1? 


A group of individuals, selected at random, are asked a question concerning 
acceptability of one of a company’s products. After an extensive advertising 
campaign, these same people are asked the question again. The data are 
recorded as follows: 


eo 
Before Campaign 
Yes No Total 


„с ———— 


After Campaign 
Yes 42 28 70 
No 20 30 50 
Total 62 58 120 


a. Can these data be analyzed as an ordinary contingency table? If so, what 
hypothesis would be tested? 
b. Test the hypothesis of no change in opinion. Use а = .05. 


In a survey of 300 people in one city, 144 people preferred brand A soap to all 
others; and in a sample of 600 people in another city, 312 people preferred the 
same product A. Is this sufficient evidence to reject the hypothesis of equal 
preference for brand A in both cities? Analyze this using the: (a) binomial and 
(b) x? distribution. [Use а = .05.] 


Evidence has demonstrated that among working women, 56 percent of those 
who stand while working have varicose veins. In a sample of 400 women who 
sit or walk while at work, 120 of them were found to have varicose veins. With 
an а risk of 5 percent, is this sufficient evidence to indicate that working 
conditions may affect the occurrence of varicose veins? 


In repeated surveys in the United States, 70 percent of high-school students 
answer “а lot" to the question "How much does it matter to a person like 
yourself whether you're healthy or not?" In a special survey among 40 ghetto 
high-school students, 80 percent stated that their health meant “а lot" to them. 
Is this sufficient evidence at a risk of o =5 percent to indicate that ghetto 
children are more concerned about their health than the average respondent? 


258 m SORTING OUT THE CATEGORIES 


7.8.7 


7.8.9 


Injuries classified as "serious" in a particular article [Kraus, J. F., and Gullen, 
W. H., An epidemiological investigation of predictor variables associated with 
intramural touch football injuries, Am. J. Public Health 59 (12), 2144 (1969)], 
were conjectured to have occurred more frequently in the later stages of a 
game. Table 10 in that article shows the following data: * 


Time Since Start Number of 
of Game (minutes) Injuries 

0-7 23 

8-14 17 
15-21 17 
22-28 24 
29-35 34 
36-42 30 
Total 145 


Using an а risk of 5 percent, do you agree with this conjecture? 


In Example 6.10.2 we tested the hypothesis that 25 percent of the student 
population from which we drew the Mecca Community College sample are 
unfavorable to legalizing marijuana, against the alternative that the percentage 
is something other than 25 percent. Test the same hypothesis against the same 
alternative, with the same level of significance (5 percent), using the x’ test. 


Three hundred and eighty-eight secondary students turned in а self- 
administered health questionnaire. Physicians’ ratings of the same 388 students 
were used as a criterion to validate their responses. The total number of “уез” 
answers in the questionnaire was recorded; each “yes” indicates the presence 
of a specific health problem. The data were recorded in the following table: 


Physicians' Classification 


Problem Problem 
No Priority With Priority 


Number of 
Questionnaire 
“Yes” 
Responses 


Total 


No Problem 


Using a x? test with а = 5 percent, determine whether the students’ ideas of 
their health status are significantly related to the physicians’ rating of their 
health status. 


*Copyright 9 1969 by the American Public Health Association, Inc. Reprinted by permission 
of the author and publisher. 


7.8.10 


7.8.11 


7.8 EXERCISES m 259 


The data from a study of the incidence of pneumoconiosis among 
Appalachian bituminous coal miners show a relation between history of dust 
on the lung and degree of pneumoconiosis. Among those working miners in the 
study the following table was constructed. 


Degree of Pneumoconiosis 


None Suspect Simple Complicated 


Dust on Lung Totals 


Positive 
history 106 14 39 38 197 
No history 2031 121 133 38 2323 


Totals 2137 135 2520 


Determine whether there is significant association between pneumoconiosis 


and dust on the lung. What happens if you collapse the pneumoconiosis scale to 
the same type of scale as dust on lung? Discuss how you would suggest doing 
this. Using your reduced table, is there significant association between dust on 
lung and pneumoconiosis? (Use a =5 percent.) 


Another table in the study described above shows a relation between 
roentgenographic findings and principal occupation. The following extracted 
data will be used: 


Degree of Pneumoconiosis 
None Suspect Definite 
248 


2143 134 
872 79 213 
3015 213 461 


Degree of Pneumoconiosis 
None Suspect Definite 
8 10 


375 
1768 126 238 
2143 134 248 


Occupation Total 


Working miner 
Nonworking miner 


Totals 3689 


Working Miner Total 


Surface 
Underground 
Totals 


2525 


a. Using а —5 percent, determine whether the degree of pneumoconiosis is 


significantly different for working and nonworking miners. 


b. Perform the same sort of test for the surface and underground working 


miner. 


c. Draw conclusions about working and nonworking miners and the degree of 


pneumoconiosis. 


Predicting 
With 


Confidence 


260 


8.1 INTRODUCTION 


In this chapter we shall discuss a 
different kind of estimation prob- 
lem. Suppose we were asked to 
estimate how much money a typi- 
cal family of four saved each year. 
In thinking about this problem, it 
would occur to us that the amount 
of money saved by the family 
would certainly depend on how 
much money the family earned. If 
we knew the family earnings we 
could probably predict the savings 
much better. Let’s consider our 
Mecca Community College data— 
one might conjecture that a stu- 
dent’s G.P.A. might be affected by 
the time the student takes in com- 
muting to school. 

Thus these kinds of problem re- 
quire the knowledge of two pieces 
of data for every observation. In 
the family savings example, for 
each family we need the amount of 
earnings, call it x, and the amount 
of savings, call it y. In the Mecca 
College example, for each student 
we need the commuting distance, 
x, and the G.P.A., y. 

In general, then, we consider a 
slightly more complex situation. 
The random variable, Y, is thought 
to be better explained by consider- 
ing its relationship to another 
measure, call it X. Our procedure 
will be to build a model relating Y 
to X (sometimes called a fixed var- 
iable). In this chapter we shall deal 
only with the simplest relationship 
between Y and X, namely a 
straight line. We shall proceed 
slowly in building this relationship 
or model. Our method is to use a 
real example with some real data. 


8.3 FITTING AN EQUATION TO THE РАТА m 261 


8.2 AN EXAMPLE 


We all know that the older an automobile is the more it costs to keep it in 
good operating condition. It has been hypothesized that the maintenance cost 
of an automobile rises at a steady rate through the first 3 years of operation. In 
order to ascertain the correctness of this hypothesis, the following data were 
collected from six car owners on the cost of maintaining their automobiles: 


Age of Car Maintenance Cost 


(in months) ($ per 6 months) 
x y 
6 50 
12 75 
18 100 
24 175 
30 200 
36 300 


8.3 FITTING AN EQUATION TO THE DATA 


We shall proceed step by step toward building a good predictive model, that 
is, an equation or formula for predicting how much the semiannual mainte- 
nance cost will be at any age of the car. 


srEP 1. First let's ignore how old the cars are, and just determine some 
characteristics of the data we have on cars. In other words, consider that we 
have a random sample of n=6 semiannual car maintenance costs. Let's use 
what we've learned to date and characterize the sample data by calculating the 
mean, y. 
j= У у_ $900 $150 
n 6 

Thus our first predictive model for the semiannual maintenance cost of 
automobiles varying in age from 6 to 36 months is $150. 


In modeling terms, we have adopted a very simple model for which the 
Observed data are: y=p+e, i=1,2,...,6, the а values being random 


deviations around the mean p. 


262 m PREDICTING WITH CONFIDENCE 


We estimate и, the population mean semiannual cost, with y, and our first 
prediction model is 


yay (8.3.1) 
that 15, 
ў =$150 


where the circumflex () or “hat” indicates that y is to be predicted by у. 


srep 2. How good is this first predictive model (8.3.1)? Once we've estab- 
lished this simple model, we should determine how good a fit to reality it gives. 
In other words, how much do the actual observed costs differ from ў; if every 
у is taken as $150, in accordance with this model? And what is the overall 
measure of the discrepancy? 


One easy way to decide this is to calculate how much of the crude variation 
in y (variation of y from zero) is explained by the model (8.3.1). We do this by 
drawing vertical lines from the x axis to the points (x, уг), Squaring the distances 
given by these lines, and tallying these squared distances (Figure 8.3.1). 


y 


6 12 18 24 30 36 
FIGURE 8.3.1 


The result of the operation on the observed data points is: 
У yi = (50 + (75): + (100) + (175)? + (200): + (300)* 
= 178,750. 


Thus we say that the amount of crude variation in costs for our data set is 
measured as 178,750. 


If we replace these individual costs by saying "the average semiannual 
maintenance cost for these cars is $150," then we're using the predictive model 
ў = 150 for all i. Graphically, we're replacing Figure 8.3.1 with Figure 8.3.2. 


83 FITTING AN EQUATION TO THE DATA m 263 


FIGURE 8.3.2 


The amount of crude variation explained by this predictive model (ў: = 150) 
is: 
(150y^ + (150): + (150): + (150)? + (150): + (150) = 6(150)* 
= п(ў)* 
= 135,000 
In general, this kind of sum (пў?) is called the sum of squares due to the mean. 
We can summarize our modeling status as follows: 
a. Using the data, we've established a predictive model, у = 150, the mean 


of the observations. 
b. Of the total crude variation of 178,750, this model explains 135,000. 


Thus the model explains 


135,000 x 100 75.5 percent of the crude variation 
178,750 : 
in our data. 


c. There remain 178,750—135,000=43,750 squared units to explain. 


264 m PREDICTING WITH CONFIDENCE 


Graphically, this can be shown as follows (Figure 8.3.3): 


y 


ЈЕ | јеле See) 
6 12 18 24 30 36 
FIGURE 8.3.3 


d. The deviations of the observations у from the first predictive model 
ў = y = 150 behave as follows: 


-_-.———-—-—-—— 


(1) (2) (3) (4) (5) 
x yı й=ў у-у | dw-yY 
6 50 150 –100 10,000 
12 75 150 -75 5,625 
18 100 150 -50 2,500 
24 175 150 +25 625 
30 200 150 +50 2,500 
36 300 150 +150 22,500 
Totals 126 900 900 0 43,750 


srEP 3. Now, we notice that there is a pattern to the remaining unexplained 
deviations (column 4). They are closely related to the values of x in 
column 1. As x increases, the deviations increase steadily from —100 to 
+150. Since y = 150 is a constant, the pattern we see in column 4 exists 
also in the original data. As a matter of fact, a practical look at Figure 
8.3.1 suggests that a straight line would fit the data pretty well. 


We recall that the equation of a straight line has the form 
y = Bot Bx, (8.3.2) 


where B, and B are numerical constants which identify the specific line 
shown in Figure 8.3.4, where f, is the value for y when x =0. 


83 FITTING AN EQUATION TO THE DATA m 265 


From Figure 8.3.4 we can see why Bo is called the y intercept of the line: B, 
is the vertical distance from the x axis at which the line crosses (intercepts) the 
y axis. The constant is called the slope of the line since it gives the rise (or 
fall) in y per unit change in x. 


In our case we have two problems. In the first place, we don't know what 


y y 


FIGURE 8.3.4 


values of Во and В are being used by Nature in the process we are studying, 
and in the second place, Nature uses that line only as an average and deals us 
observations at variable locations off the line. So we have to use our observed 
data and estimate the line—that is, estimate B and Во. 
(Figure 8.3.5). 


We've already decided to use ў = ӯ as a first step. Thus let's make sure that 
the new line goes through the point (x, y). In our data this point is (21 months, 
$150). It is shown in Figure 8.3.5 as an open square with a dot in the center. 
can be drawn through the point 
line which comes *'closest" to all 


Let's plot the data 


There are many possible straight lines that 
(X =21, у = 150). We would like to choose the 


the points. 


y 


300 
250 
200 
150 | eene 2 
100 : 


50 


6 12 18 24 30 36 


FIGURE 8.3.5 


266 m PREDICTING WITH CONFIDENCE 


EXERCISES 


8.3.1 Draw a line by eye that fits the car maintenance data as best you can. Make sure 
it goes through the point (x = 21, у = 150). Determine the slope of the line 
you've drawn using any two points (xı, yı) and (x2, у) by the formula: 


slope = = 

X1—Xi 
Then determine the distance of each observed data point from the line you've 
drawn. Square each of these distances and add them up. How close did you 
come to zero? This would represent a line that goes through every data point. 


8.3.2 The following data were collected by a chemist on the yield of a production plant 
at varying levels of process temperature. 


x y 
(in 10s of degrees Fahrenheit) (yield in 1000s of pounds) 


7 
8 
9 
10 
11 


оолло ~ 


a. Putting temperature on the horizontal axis and yield on the vertical axis, 
plot the five (x, y) points. 

b. Calculate x and y. Draw by eye a line through (X, y) to fit the data. 
c. Do you think temperature is a pretty good predictor of yield in this 
process? 


8.3.3 Another chemist in the same plant mentioned in Exercise 8.3.2 gathered data on 
pressure and yield. His data were as follows: 


x y 
(10s of millimeters of mercury) (yield in 1000s of pounds) 


7 3 
8 2 
9 1 
0 4 
1 5 


8.3 FITTING AN EQUATION TO THE DATA m 267 


а. Putting pressure on the horizontal axis and yield on the vertical axis, plot 
the five (x, y) points. 

b. Calculate х and y. 

с. By eye, draw a best-fitting line to the data through (x, y). 

d. Do you think this line is as good as that in Exercise 8.3.2? 

e. Which is a better linear (straight line) predictor of yield—temperature or 
pressure? 


The line you drew in Exercise 8.3.1 is undoubtedly very close to the best that 
can be done. Your process of “eyeballing,” however, does not allow scientific 
measurement of just how good it is. In fitting straight lines to data, there is a 
procedure that allows such measurement, and moreover gets the prize for “best 
fi." We say best fit here in the sense of minimizing the sum of squared 
distances that exist between the data points and the fitted line. In our example 
on automobile-maintenance costs, that sum of squared distances, remember, 
was 43,750 [X (y — у) in column 5 of subparagraph (d) above] when we fitted 
just the mean line ў = 150 to the data points in Figure 8.3.3. The "line of best 
fit" will give a much smaller sum of squared deviations. 


The complete official name of the line to which we are referring is the line of 
best fit according to the principle of least squares. The expression “least squares" 
is shorthand for “smallest possible sum of squared vertical deviations between 
the data points and the line.” Long a common tool of widespread use among 
scientists of all kinds, the line is customarily referred to more briefly as the 
least-squares line or the least-squares line of best fit. 


Working out the mathematical mechanics to produce a formula for the 
least-squares line is an exercise in algebra or calculus which need not concern 
us here. The result is easy to state and use. The least-squares line goes through 
the point of means (x, y) and has slope given by the formula 


Ў (х= (у= 9) 
b= 


268 m PREDICTING WITH CONFIDENCE 


Like the sum of squares for y that we use in calculating the sample variance 
5, the above formula can be put into other forms somewhat more convenient 
for computing. The entire recipe for the least-squares line can be stated as 
follows: 


Least-squares line 
ў RT y mE b(x an x), 


where (8.3.3) 
__У(х— Ху у) Xxy-nxy 
и те авг 


After the equation for the line has been written in accordance with (8.3.3), it 
can be put in the form of (8.3.2) by multiplying out and collecting terms: 


ў =(y — bx) + bx 
Here we see b as our estimate of B, and (¥—bx) as our estimate of Bo. 


Let us apply this procedure to our data on automobile-maintenance costs 
(Table 8.3.1). 


TABLE 8.3.1 


6 —15 50 —100 1500 225 
12 -9 75 -75 675 81 
18 =3 100 =50 150 9 
24 +3 175 +25 75 9 
30 +9 200 +50 450 81 
36 +15 300 +150 2250 225 
Totals 126 0 900 0 5100 630 
Means 21 150 
5100 
b= 7630 = 8.10 


This means that the maintenance cost increases approximately $8.10 each 
month during the first 3 years (36 months) of the automobile's life. 

Calculations making use of the alternative computing form of b are as 
follows (Table 8.3.2). [We carry along y^ calculations because they will be 
needed sooner or later.] 


83 FITTING AN EQUATION TO THE DATA m 269 


TABLE 8.3.2 
x у xt у? ху 
6 50 36 2,500 300 
12 75 144 5,625 900 


18 100 324 10,000 1,800 
24 175 576 30,625 4,200 
30 200 900 40,000 6,000 
36 300 1,296 90,000 10,800 


Totals 126 900 3,276 178,750 24,000 
Miu  ——_ 


У (у-у) =Z y^ ny = 178,750–6(150.0)' = 178,750- 135,000 — 43,750 
У (x-3) =} x°- nk? = 3276 –6(21.0)' = 3276-2646 = 630 
У (х (уў) = Уху - пу = 24,000 – 621 150) = 24,000 — 18,900 
= 5100 


The equation of the least-squares line is 
ў = 150+8.10(х—21) 


5ТЕР 4. How good is this predictive model? 

We recall that we had 43,750 squared units of variation left over (i.e., 
unexplained) after fitting the model у= +е and obtaining the fit (8.3.1) 
ў = 150. The question to be asked is “how much of the 43,750 squared units 
has been explained by model (8.3.2) for which the fit by (8.3.3) is yi= 
150 + 8.10(x, - 21)?" Let us calculate the fitted value ў: for each x; in the data 
set. The details are worked out in Table 8.3.3. 


TABLE 8.3.3 

X yi ӯ = 150 +8.10(ж — 21) y-ù -Y 

6 50 28.5 21.5 462.25 

12 75 77.1 -2.1 4.41 

18 100 1257 . —25.7 660.49 

24 175 174.3 07 0.49 

30 200 222.9 -229 524.41 

36 300 271.5 28.5 812.25 

Totals 126 900 900 0 2,464.30 


став 8 58 __________---=------- 


270 m PREDICTING WITH CONFIDENCE 


Thus we see that of the 43,750 squared units, only 2464.30 are left over 
(unexplained). We can say that fitting model (8.3.2) to the data explains 


43,750— 2464.30... 41,285.70 
43/50 — *1007 743750 


of the variation in y remaining after applying y to the data (see Figure 8.3.6). 


X 100 = 94.4 percent 


y 


FIGURE 8.3.6 
STEP 5. Using the above procedure to fit a line to y data influenced by a 
characteristic x, we have advanced from the simple model 
У =и+Е, (= 190 Flt 
to the better-fitting model based on (8.3.2): 


Yi = u (x - X) E, 1251,2,..., п, (8.3.4) 
ог 

Yi = Bo- Bxi + Ei, i124 (8.3.4) 
where Во= u – Вх. In this model the random variation is contributed by the 
random variables Ei, E2,..., E. These are considered to be independent 
random variables, each having mean zero and variance o^. This tells us that 
Yi, Yo, ..., Yn are independent random variables such that 


mean of Yi = €(Yi) = Во + Вх, 
variance of Y; = o? for every i. 


83 FITTING AN EQUATION TO THE DATA ш 271 


Our estimate of the mean (Во+ Bx) is precisely the least-squares fit (8.3.3): 
ў =ӱ + b(xi— X). 


The true population mean line 
y = Bo+ Bx 


is called the regression line of y on x. The least-squares line (8.3.3) is then 
often referred to as the estimated regression line of y on x. In the sense of 
these terms, the relationship of y to x is called linear regression. 


We can now summarize our findings in Table 8.3.4, called an Analysis of 
Variance (ANOVA) table. In this table s^ = residual variance is a valid estimate 
of o°, the unknown population variance. The У52 = 5 is an estimate of the 
population standard deviation, с. 


TABLE 8.3.4 Analysis of Variance 


Degrees of Sum of Mean Square 
Source of Freedom Squares (Sum of Squares + d.f.) 
Variation (d.f.) (S.S.) (M.S.) 
Total (crude) п=6 Уу?= 178,750 
Sample mean 1 ny? = 135,000 


Total (corrected 


for the mean) n-1=5 43,750 
Slope (regression) 1 41,285.70 41,285.70 
Residual 4 2,464.30 616.075 = 5" 


 Resdusl . ЧЕША ае = 


In our example, 
s?= 616.075, 
s = 24.82. 
Notice that the sum of squares for residual (2,464.30 in the above example) is 


exactly the measure of unexplained variation which is worked out in the 
manner of Table 8.3.3. Thus we have the meaningful formula: 


n A 
2f; ; (8.3.5) 


272 т PREDICTING WITH CONFIDENCE 


Also, it can be shown by algebraic manipulations that 


| S.S. for total IE [ S.S. for IE [ 8:5: E 
(corrected for the mean) regression residual 


is expressible as 
У -pb G-3(-73)*2 (у 9). (8.3.6) 
It is.often more convenient computationally to use (8.3.6) to compute first the 


sum of squares for regression and then get by subtraction the sum of squares 
for residual. 


8.3.4 a. Using the format in Table 8.3.1, determine b, the slope of the best-fitting 
straight line to the data of Exercise 8.3.2, repeated here for your convenience. 


х 
< 


ьа 
2 о Фо мч 
Onnu =e 


b. Write the equation of best fit as 
ў=ў+Ь(х—Х) 
c. When x = 0, determine the value of ӯ. This is called the intercept. 


8.3.5 Using the following data, determine the slope b and the intercept of the 
best-fitting straight line for predicting yield as a function of pressure. (Use the 
format in Table 8.3.2.) 


x y 
(10s of millimeters of mercury) (Yield in 1000s of pounds) 


о 
лљ»—> о 


8.3 FITTING AN EQUATION TO THE DATA m 273 


8.3.6 Using the results of Exercise 8.3.4 above: 


a. Calculate $ (the predicted value of y) for each x value in the data. 

b. Calculate у — ў. for each of the points in the data set. 

c. Do you see any patterns in these residuals that would make you reject the 
straight line as good prediction equation? 


8.3.7 Using the data and results of Exercise 8.3.5 above: 
a. Calculate the predicted value of y for each value of x in the data set. 


b. Calculate the residuals у – ў. for each data point. 
c. Do you see any patterns in these residuals? If so, what do you think of the 


straight line as a predictor? 


8.3.8 The following data table was given for fitting a straight line to a set of data. The 
best-fitting straight line was ў = 10—.82(x 27); 


х у у-уУ у y-9* (у= 9) y! (x-3) (х=)? (4 )(у:— Y) 
1 14 4 14.92 -0.92 0.8464 196 -6 36 –24 
3 15 5 1328 172 2.9584 225 -4 16 -20 
5 11 1 11.64 -0.60 0.4096 121 -2 4 -2 
7 10 о 10.00 0 0 100 0 0 о 
| ЖИ Ел 8.36 0.64 0.4096 81 +2 4 -2 
11 E = 6.72 —172 2.9584 25 +4 16 –20 
19; "B "4 5.08 0.92 0.8464 36 +6 36 —24 
Totals 49 70 0 70.00 0.00 8.4288 784 0 112 -92 


a. Using the format shown in Table 8.3.4, construct the analysis of variance 
summary for this model. 

b. Calculate s?, the estimate of the unknown variance 07. 

c. Calculate s, the standard deviation. 

d. Using the formula shown in Step 
explained by fitting the line's slope to the 
follows: 


4, how much of the variation in y is 
data? The formula is written as 


Sum of squares due to slope x 100 percent 
Total (corrected for mean) sum of squares 


e. Do you think this straight line is a good prediction equation for these data? 


The regression model (8.3.4) is a better model than the simple Y; = p +E if 
and only if 8 is not zero. Our estimate of B is the value b computed as the 
slope of the least-squares line. We shall want to test whether this value is 
"significantly different" from zero. The logic of Chapter 6 will apply; we shall 


need a few new details. И 
Should we decide that b is significantly different from zero, then we will want 


to use the least-squares line (8.3.3) as a predictor for Y: 


ў=ў+Ь(х—Х). 


274 m PREDICTING WITH CONFIDENCE 
The logic of Chapter 5 will guide us in formulating confidence intervals for 
estimation. We shall be careful about necessary details. 


The theory that we want to apply requires that the data come from a 
population having a normal distribution, and we now make that assumption. 


Specifically, we assume that the observations y; are a random sample of 
observations from normal distributions whose true means fall on the true line: 
mean of Y = Во+ Bx, and for which the standard deviation is с at every value 


of x. This can be shown graphically as follows: 


Thus, after fitting our least-squares line to the data, the square root of the 
residual mean square = Js? = 616.075 = 24.82 is an estimate of the o shown 


in the above figure. 


8.4 TEST OF HYPOTHESIS ABOUT В m 275 


8.4 TEST OF HYPOTHESIS ABOUT В, THE SLOPE OF THE POPULATION 
REGRESSION LINE 


We can now state our hypothesis about the slope B of the true underlying 
line: 


Ho:B=0 versus Ha:B О 


Test of Ho:B = 0 versus Нл: 0 can now proceed as in Chapter 6. Let us 
take а — level of significance at 5 percent. 
If Ho is true, then 
b-0 
УМУ (х - XY 
Reject Ho if t < —2.776 or if 12 42.776; accept Ho otherwise. From the sample 
data, giving the results shown in Section 8.3, we have 
_810:=0 8,10 suon 
t= 7500 72482 09888 71 
/630 25.10 


Reject Но. The observed regression line slope is significantly different from 
zero, at the 5 percent level of significance. 


=t with 4 d.f. 


276 m PREDICTING WITH CONFIDENCE 


The conclusion reached here (observed slope significantly different from 
zero) is often expressed as the observed regression is significant. 

As in other tests of significance, we can state the descriptive level of 
significance P. Here it is 


P -2P(tz8.19|4d.f.); .001 € P «.01. 


8.5 INTERVAL ESTIMATE FOR f 


If the slope is not zero, what is it? Let us make a 95 percent confidence 
statement about the true slope, В. 


Now that we've concluded the true slope is not zero, our best point 
estimate of B is b=$8.10/month. Using procedures we've learned in 
Chapter 5, let's place confidence limits on В. Recall the general form of 
the confidence interval when o^ is unknown, 


Ке standard deviation parameter 
а yin of the « being 
parameter estimate} estimated 
АТА standard deviation 
Xara Tx of the 
estimate : 
parameter estimate 


where tx is the chosen value of t in the t distribution that defines the re- 
gion of interest. 


For our interval estimate of В, the true slope, we have: 


b — tse <B <b tss (8.5.1) 


To place 95 percent confidence limits on B, we choose tx= 2.776. (This must 
be the same t used for the critical region in the test of hypothesis at the 5 
percent level of significance. This will ensure consistent results.) The 95 
percent confidence interval for B is then: 
8.10— (2.776)(.9888) < B < 8.10 + (2.776)(.9888) 
8.10—2.74« B <8.10+2.74 
5.36 « 8 <10.84 (8.5.2) 


Note that this interval does not include zero. If it did, we could not have 
rejected the null hypothesis, namely Ho: = 0. 


8.6 PREDICTING THE AVERAGE RESPONSE AT A СМЕМ VALUE OF x m 277 


EXERCISES 


8.5.1 Using the data in Exercise 8.3.7 and the calculations obtained: 


a. Use the residuals y, — ў to obtain an estimate of а“. (Hint: use format in 
Table 8.3.3 for the formula (8.3.5) for s?.) 

b. Determine how many degrees of freedom there are in estimating o°. Explain 

the reasoning for this answer. 

c. Calculate the estimate of the standard deviation of the slope, s, using 
equation (8.4.2). 

. Using а = 5 percent, test the hypothesis that the true underlying slope B is 
really zero, that is, Ho:8 —0 versus the alternative Ha4:87 0. [Hint: use 
formula (8.4.3).] 

e. State why you used a t test rather than a z test. 


Qa 


852 In Exercise 8.3.8, the residual standard deviation is s = 1.30. Using this figure 
and additional information obtained in Exercise 8.3.8, place 95 percent confi- 
dence limits on the true unknown slope В. Based on this interval estimate, would 
you reject Нь:В — 0 and accept Ha: 0 with а = 5 percent? 

a. What are the degrees of freedom for s? 
b. What value of tẹ did you use? How did you find it? 
c. What was the value of У (x, — x)? Where is this figure used? 


8.6 PREDICTING THE AVERAGE RESPONSE AT A GIVEN VALUE OF x, 
SAY x. 

If you desire an average response at à particular value of x, say Xi, the point 
estimate is given by substituting the particular value xx in the equation of the 
best-fitting least-squares line: 

ў =¥t+b(x% – Х). (8.6.1) 


For example, suppose we want to predict the average maintenance cost per 
month for a car 24 months old. Then in 


ў = 150-8.10(x —21) 


we take x = х = 24, and obtain 
у = 150 + (8.10)(24— 21) 


giving the final result 
ў = $174.30 (8.6.2) 


In obtaining this predicted value ју, we used the least-squares equation 
ў=ў+Ь(х—Х). In this equation are two estimates, y and b. Each of these 
estimates was obtained using a sample of six cars between 6 and 36 months of 
age. One realizes that the particular y and b obtained from these six cars are 
just one set of possible results. If we were to get another sample of cars 


covering the same age span and assuming the linear relationship relating 


278 m PREDICTING WITH CONFIDENCE 


maintenance cost and age to be correct, we would obtain another set of 
estimates y and b. Thus we are saying that y has a sampling variance and so 
does b. Using both these estimates in predicting an average result at x = x. 
means that the predicted value ў, has a variance that includes them both. 


We now write down the variance of yy as follows: 
Маг (Fx) = Var [y + b(x. — x)] 
= Var (y) + (xx — x)? Var (b) 


о? (8.6.3) 


п 


Ў, (х -xy 


і=1 


2 
а m 
= nt -xy 


2 
Some of this should look familiar: Var(y) == comes from as far back as 


Chapter 5 in our study; the value of Var(b) appeared recently above, in 
(8.4.1). Why everything gets put together in the manner shown is another 
matter—and a matter whose proof goes beyond what is either convenient or 
instructive in this introduction to statistical inference. So, as you have already 
done on a number of occasions, take our word for the accuracy of the stated 
formula. 


Since we do not know c^, we will substitute our best estimate of o^, namely 
52, the residual mean square obtained in the ANOVA table following the 
fitting of the least-squares line. We will put a hat () on the Var to show that 
we are estimating that variance, and set down the practical formula for our use: 


Var Ga) = = iu] (8.6.4) 


From this of course we have the estimated standard deviation (standard error) 
of ӯ, as the square root: 


; sous ~ 1 x) 
S == = + 8.6.5 
;, = estimated standard deviation of yrs У (xx) x) ( ) 


8.6 PREDICTING THE AVERAGE RESPONSE AT А GIVEN VALUE ОР x m 279 


In (8.6.2) we reached the predicted value for the mean maintenance cost of a 
car 24 months old: ў, = $174.30. If we will now go back through our various 
calculations and take what we need, we can work out the estimated standard 
deviation of the prediction according to (8.6.5): 


Estimated standard deviation of {fx | x = 24} = 2482 + 21 


È | M ad 
= 24.82) 0.1667 + 230 


=24.82V0.1667 4-0.0143 


= 24.82/0.1810 
= 24.82(0.4254) 
= 10.56 


We are now ready to put confidence limits around our predicted mean value. 
These have the same form which we have used repeatedly: estimator plus- 
minus so many estimated standard errors. In the present situation, the confi- 
dence interval is the following: 


yk — tess, < true mean Y at x = xx <fx fas, (8.6.6) 


where 
Je -ytb(x-x), 


а) SQ — x) 
БОА сера 
s = Vresidual mean square in ANOVA, 


tx = the confidence-limit value of t 
with n —2 d.f. 


In our example we have, from earlier calculations: 
хк = 24; $x = 174.30; 5з, = 10.56; n=6 


For a 95 percent confidence interval, we take t from the table of t with 
6—2 — 4 d.f., getting t4 = 2.776, and so have the confidence limits 


174.30 + 2.776(10.56) = 174.30 x 29.31, 
giving the 95 percent confidence interval as 


144.9 « true mean cost at 24 months < 203.61, 


or, more reasonable to report, 
$145 «true mean cost at 24 months « $204. 


280 m PREDICTING WITH CONFIDENCE 


Notice in formula (8.6.3) for the variance of the predicted mean value у 
that the variance of the slope b is always multiplied by (xk — x)’. That multiplier 
is zero if x, = X, and gets bigger and bigger as xx is taken farther and farther 
away from x. In other words, the further you go from (X, y) to predict the 
response, the more variation or wobble due to the slope will affect the 
prediction. Consider Figure 8.6.1. 


FIGURE 8.6.1 


If the variation in the slope is depicted by two lines intersecting at у, then the 
further you go from X in either direction, the larger the effect of the variation 
in slope. 


Let's now calculate the standard error of the predicted value of the mean 
maintenance cost for each of the values of x; in the data of our example. The 
following table gives the results. We have the values of ӯ: from Table 8.3.3 and 
set up the computation of Var (у; ) from (8.6.4) put into the convenient form: 


s^ | 5 (Ж -21y _ 616.075 , 616.075 
n У (х=)? 6 630 
= 102.68 + 0.9779(x. – 21) 
From Table 8.6.1 we can quickly put 95 percent confidence limits around , 


each ў, applying (8.6.6) and taking 1+ = 2.776 as we did before in our example. 
(See Table 8.6.2.) 


(xi -21y 


TABLE 8.6.1 
TM nr ET y Var (ӯ) 5" 
6 -15 50 285 102.68+0.9779(-15)?=322.71 17.96 
12 -9 75: - 724 102.68--0.9779(-9)- 181.89 13.49 
18 -3 100 1257 102.68+0.9779(-3)?=111.48 10.56 


24 +3 175 174.3 102.68 + 0.9779(+3)? = 111.48 10.56 
30 +9 200 222.9 102.68 + 0.9779(+9)? = 181.89 13.49 
36 +15 300 271.5 102.68 + 0.9779(+15)? = 322.71 17.96 


8.6 PREDICTING THE AVERAGE RESPONSE AT A GIVEN VALUE OF x m 281 


TABLE 8.6.2 
95 percent Confidence Limits 
Xi yi yi Sy, tæ’ S; Lower Upper 
a ee 
6 50 28.5 17.96 49.9 0* 78.4 
12 75 77.1 13.49 37.4 39.7 114.5 
18 100 125.7 10.56 29.3 96.4 155.0 
24 175 174.3 10.56 29.3 145.0 203.6 
30 200 222.9 13.49 37.4 185.5 260.3 
36 300 271.5 17.96 49.9 221.6 321.4 


*Lower bound must be zero for these data. 


Plotting these confidence limits on the graph with the best-fitting straight 
line, we have Figure 8.6.2. 


Notice how the confidence band widens as we move farther and farther from 
x—X = 21. Outside the range of x that we have had in our actual data (here 
6-36) even the very wide confidence band is undependable, and we must 
follow the scientist's general rule: extrapolation beyond the range of your data 
is extremely dangerous. Take care! 


y 


FIGURE 8.6.2 


282 m PREDICTING WITH CONFIDENCE 


8.7 PREDICTING THE NEXT OBSERVATION AT A GIVEN VALUE OF x, 
SAY x. 


In the last section we dealt with the prediction of the mean response at a 
particular value of x. In our example we raised the question, “for cars 24 months 
of age, what is our prediction of mean maintenance cost, and what is a 95 
percent confidence limit on that prediction?" Our answer to this used the 
regression analysis worked out thus far. Using the observed data and assuming 
that maintenance costs were linearly related to the age of cars, our prediction 
for the mean 6-month maintenance costs for cars 24 months old was $174.30. 
Our 95 percent confidence statement was that the true mean 6-month mainte- 
nance bill for cars 24 months old is between $145 and $204. 


The natural question to ask is, “while that's not so bad for average mainte- 
nance costs, what about my particular car?" In other words, what can we say 
about an individual car or observation? Obviously when we start concerning 
ourselves about a single observation (here, a car), our point estimate will be the 
same but the car-to-car maintenance cost variability for cars of the same age is 
much bigger. Let's look at the following graph (Figure 8.7.1) predicting ў, 
when x = х = 24 months. 


y 


24 
(a) (b) 


FIGURE 8.7.1 


In Figure 8.7.1a we have the predicted value, y. = $174.30 when x= 
24 months. From the preceding section we know that the estimate y. has a 
variance based on the straight-line model; that is, its variance has two compo- 
nents, one due to estimating the overall mean by y, and the other due to 
estimating the slope B by use of b. Hence our prediction of the regression line 
is subject to variability, and in Figure 8.6.2 we saw the shape of a confidence 
band for it. All this has to do with just the mean response at a given value of x. 
In Figure 8.7.1b we see an additional variance component. This component is 


due to the distribution of single observations about the regression line when 
хк = 24. 


8.7 PREDICTING THE NEXT OBSERVATION AT А СМЕМ VALUE OF x m 283 


Thus if we are to make a confidence interval statement about the next single 
observation, say at х, = 24, we will need to add a component for the individual 
variance to the components due to mean and slope. Thus the variance of the 
next observation taken at х is found by adding a^ to the variance in (8.6.3): 


2 
Var (next observation at x.) = a (а - gy ——— (8.7.1) 
p pd) 


Substituting s^ for a^ and taking the square root, we get the estimate of the 
standard deviation (standard error) of the next observation at x = xx: 


Estimate of the PASE 
standard deviation of | — s il i I Vu. 
^ 2 (8.7.2) 
the next observation X (x — X) 
atx =Xx 


Note how this compares with (8.6. 5), the estimated standard deviation of the 
mean response at хе: here we add 1 to the quantity under the square-root sign. 


From this we then set up the confidence interval for the next individual 
observation on Y: 


MX 
ў = 531+ о s 
< next observation on Y at x = Xx 
Ж X р 
«у из | 1+— SCE; Ds (8.7.3) 
where the notation has the same meaning as in (8.6.6), 


In our example on car maintenance costs for car age 24 months, we had 


earlier: 


yx = 174.30, 59, = 24.82V0.1810 = 24.82(0.4254) = 10.56 


284 m PREDICTING WITH CONFIDENCE 


To predict the 6-month maintenance cost of a specific individual car 24 months 
old, we again take the point estimate as jx = 174.30. But now we use (8.7.2) 
and (8.7.3). The standard error is: 


24.8241-- 0.1810 = 24.82/1.1810 = 24.82(1.087) = 26.98, 
and then the limits of a 95 percent confidence interval are 
174.30 + (2.776)(26.98) = 174.30 € 74.90, 


giving the 95 percent confidence interval: 


next observation on 
99.40 —4 6-month maintenance cost р — 249.20 
of car 24 months old 


Тће 95 percent confidence limits on the next observation at each of the x 
values given by the x; of our data are recorded in the Table 8.7.1, where also 
the limits in Table 8.6.2 have been repeated for comparison. 


TABLE 8.7.1 

Standard Standard 95 percent Confidence Limits 

Deviation Deviation True Mean Next 

of Mean of Next Response Observation 
X yi Vi Response Observation low high low high 

6 50 28.5 17.96 30.68 0* 78.4 0" 113.7 

12 75 77. 13.49 28.25 39.7 114.5 0° 155.5 
18 100 125.7 10.56 26.98 96.4 155.0 50.8 200.6 
24 175 174.3 10.56 26.98 145.0 203.6 99.4 249.2]" 
30 200 222.9 13.49 28.25 185.5 260.3 144.5 301.3 
36 300 271.5 17.96 30.68 221.6 321.4 186.3 356.7 


“ As in Table 8.6.2, zero cost must be taken as lowest possible bound. 


9 Calculations for this interval are shown in the text above. 


As we expected, the confidence intervals on the next observation are much 
wider than those on the mean response, reflecting the additional variance 
component due to the distribution of individual responses about the true 
regression line. 


8.8 ANALYSIS ОР ВЕЗОЏА 5 m 285 


8.8 ANALYSIS OF ВЕЗОЏА 5 


Let's consider what we've done up to this point in this chapter. We started 
out by using a simple predictive model ў = or ў = 150. Then we noted that 
the deviations ў — 150 were closely related to the values of an independent or 
predictive variable x. So we then expanded our model through least squares 
and obtained the predictive model ў = 150 -- 8.10(x: - 21). We found that this 
model explained 94.4 percent of the remaining residual variation. This straight- 
line model is a good one. To reinforce this, using a t test we rejected the null 
hypothesis that B, the true slope, was equal to zero. Then in order to use the 
model for prediction, we calculated confidence intervals for both the mean 

© response at a given value of x and for the next observation at a given value of 
% 


Even though everything we have done has led to evidence of an excellent 
linear prediction model, we should still make sure that there exists no evidence 
that this good model could be further improved. As we did in Step 2, Section 
8.3, let us look at the residuals in Table 8.3.3, and also divide them by the 
residual standard deviation Vs* from the ANOVA Table 8.3.4, and retabulate 
them (Table 8.8.1). 


TABLE 8.8.1 
Xi yi у у-у (у= Vs 
6 50 28.5 21.5 21.5/24.82-  .87 
12 75 774 SZA —2.1/24.82= —.08 


18 100 125.7 -25.7  –25.7/24.82 = -1.04 
24 175 174.3 0.7 0.7/24.82=  .03 
30 200 222.9 -22.9  -229/2482- -.92 


36 300 271.5 28.5 28.5/24.82= 1.15 


There are no discernible patterns in the residuals, and, further, none of the 
standardized residuals is close to +1.96. With this additional evidence we 
should now feel much more secure in using the predictive model, 


ў =150+8.10(x —21). 


The final lesson then is: no matter how statistically significant any model seems 
to be, always look at the residuals after fitting the model. Then, and only then, 
should one be secure with a predictive model. 


286 m PREDICTING WITH CONFIDENCE 


EXERCISES 


8.8.1 


8.8.2 


8.8.5 


Using the table of calculations shown in Exercise 8.3.8 and the best-fitting 
straight line ў = 10—0.82(x — 7): 


a. Determine the standard deviation of the predicted value ў, when x,=7 
(Hint: use format in Table 8.6.1). 

b. Determine the standard deviation of the predicted value of у, when x, = 13. 

c. Why is the value of s;, larger when x, = 13 than when x, = 7? 


Using the same table of calculations shown in Exercise 8.3.8, 


a. Calculate 5, when x, = 1. 
b. Using format of Table 8.6.2, construct 95 percent confidence limits on the 
average response of y, using a linear model, at the three points where 


x = 1, x = 7, апа ж = 13. 


с. What value of t, did you use? Why? 
d. Plot the 95 percent confidence limits on a graph relating y and x. 


For the same calculations in Exercise 8.8.2, add the next columns such that one 
obtains the 95 percent confidence limits on the next observation. 


Using the data in Exercise 8.3.2 (also used in Exercises 8.3.4 and 8.3.6) and 
using the format of Tables 8.6.1, 8.6.2, and 8.7.1, construct the 95 percent 
confidence limits on the next observation for each value of x, in the data set. 
(Hint: in order to do this, you'll need many of the results previously calculated.) 


Calculate the standardized residuals for the fit of the data in Exercise 8.3.4, 
using the format shown in Table 8.8.1. 


a. Are there any discernible patterns in the data? 
b. What are your final conclusions about the fitting of the linear model to these 
data? 


In Exercise 8.3.7, take the residuals obtained in this exercise and standardize 
them as indicated in Table 8.8.1. 


a. Does this added information reinforce your opinion of the usefulness of the 
linear model? 
b. What is your opinion about the utility of this linear prediction equation? 


8.9 
8.9.1 


8.9.2 


89 SUMMARY EXERCISES m 287 


SUMMARY EXERCISES 


During the time period 1958-1961, the shipments of the machinery and 
equipment industries to customers began to rise. The quarterly shipments 
during this period are shown below: 


Shipments 
(billions of 1957-1959 
Quarter dollars) 
x y 
January-March, 1958: 1 7.5 
2 74 
3 7.5 
4 73 
5 8.0 
6 8.3 
7 8.5 
8 84 
9 8.4 
10 8.6 
11 8.4 
12 8.1 
13 8.0 
14 8.2 
15 8.5 


. Fit a linear model of the form у = Bot Bx, + E; to these data. 

b. Test the hypothesis H»:8 =0 versus Ha:B#0, using а = 5 percent. Draw 
tentative conclusions. 

. How good a fit is this model? 

. After finding the best-fitting straight line, calculate the residuals from the fit. 

e. Are you satisfied with this model? Do you think the model could be 


improved? Why, or why not? 


m 


ao 


The U.S. Coast Guard is responsible for dealing with the oil spilled in the 
harbors of the U.S. Under normal conditions (no major accidents), oil in the 
harbor is caused by leakages, and discharges from ships, plants, and so on. 
Much of the oil that can be detected by eye comes from unknown sources, and 
much of the oil spillage is often not detected. It has been conjectured that the 
amount of oil spillage reported is a direct function of how many ships are 
boarded by the Coast Guard and inspected for oil-spillage potential (i.e., the 


288 m PREDICTING WITH CONFIDENCE 


8.9.3 


more you look for oil trouble, the more you find). The following data were 


collected on this conjecture: 


Average Number of Average District 

Ship Boardings per Quarterly Oil Discharge 

District by Quarter Volume 

x y 

January-March, 1971 723 86,888 
April-June 625 26,844 
July-September 620 45,975 
October-December 612 49,211 
January-March, 1972 712 88,876 
April-June 565 44,652 
July-September 517 25,610 
October-December 554 67,192 


А 


Using an а = 5 percent significance level, do you agree ог disagree with the 


conjecture? 


A. test was conducted to determine the capacity of a soap-making machine for 
different cooling-water temperatures. The results were as follows for a fixed 


water flow rate: 


Cooling-Water Soap-Production 
Temperature Rate in 
in degrees Fahrenheit pounds per minute 
63 185 
63 170 
64 160 
64 150 
64 155 
66 150 
67 140 
69 140 
69 120 
70 115 
70 125 
72 110 


а. Draw your graphical estimate of the best regression line. 


b. Calculate the regression line for the model Yi = Bo Bx + E, using the 


least-squares procedure. 
c. Draw the calculated line for comparison with (a). 
d. Is the regression significant at а = .05? 


€. Calculate 95 percent confidence limits on the expected (mean) response at 


x = X. 
f. Draw conclusions. 


8.9.4 


8.9 SUMMARY EXERCISES m 289 


The problem of the availability of physicians is a serious one. People who live 
in remote or relatively unpopulated areas have difficulty reaching a physician. 
Several programs have been initiated by the government to alleviate the 
problem. One outstanding educator has suggested that establishing a medical 
school in a university well removed from the center of population would attract 
physicians to that area. The question arose, “excluding the county that includes 
a medical school, is there a relationship between the distance to a medical 
school from the county seat and the number of physicians in the county?" The 
following data are a sample of a complete set from the state of North Carolina. 


——————« 


Distance (Rounded to Number of Physicians 


Nearest Mile) to per 100,000 
a Medical School Population 
County x y 
1 84 19 
2 38 58 
3 77 58 
4 32 79 
5 45 72 
6 100 98 
7 56 110 
8 45 117 
9 28 95 
10 25 130 
11 147 138 
12 95 84 
13 82 55 
14 63 81 
15 18 169 
16 65 165 
17 10 40 
18 79 72 
19 52 16 
20 67 227 
21 77 123 
22 36 215 
23 76 100 
24 47 110 
25 48 117 


. Fit a straight line to the data. 

. What is your estimate of the slope p? 

What is the estimated standard deviation of the slope? 
. Test Но:В = 0 against H,:6 0, using a = 10 percent. 
. What are your conclusions? 


onocrm» 


290 m PREDICTING WITH CONFIDENCE 


8.9.5 


The following data have been collected on required homework grades and on 
the test performance of students in one science class in a major university. The 
instructor of the class decided to determine whether he could predict test 
performance using average required homework grades obtained prior to the 
test. 


eee 


Required Homework Test-Performance 


Average Grade Grade 
oe ee ege c MR 
53 30 
67 33 
51 56 
58 48 
64 40 
65 39 
69 46 
62 53 
70 58 
78 55 
77 65 
84 64 
93 58 
88 68 
74 83 
93 73 
92 80 


. Determine the best-fitting linear prediction equation of the form ў = bo + bx. 
. What is the estimated standard deviation of b? 
. Place 95 percent confidence limits on true f. 
. What is the estimated standard deviation of the predicted mean test score 
when the average homework score is x, — 70? 
e. By choosing three values of x, draw 95 percent confidence limits for the true 
mean test score line. 
f. Draw conclusions. 
| 


по с Р 


The following data show the amount of money (millions of dollars) invested in 
public and private nonresidential construction. These data are in constant 
dollars, which eliminates the effect of price changes. 


a. Determine the slope of the best-fitting straight line relating year to public 
nonresidential construction dollars. [Take year as x=1,2,..., 13.] 
b. Determine the slope of the best-fitting straight line relating private nonresi- 
dential construction (in dollars) to the year. 
. What can you say about these data? 
d. Draw inferences with the results you have calculated. 


о 


8.9 SUMMARY EXERCISES m 291 


Public Private 


1960 15 16 
1961 16 17 
1962 16.5 17.5 
1963 18 18 
1964 19 19 
1965 21 25 
1966 22 27 
1967 24 28 
1968 26 30 
1969 26.5 33.5 
1970 27 35 
1971 28 36 
1972 29 39 


8.0.7 “Sticky” shampoo shipments are given for months 4-23 following national 
introduction. 


у = Shipment Size 
x = Months since Introduction (thousands of cases) 


4 43 

5 55 

6 73 

7 53 

8 101 

9 59 ух= 270 
10 43 Yx'24310 
11 52 n- 20 
12 44 
13 44 
14 51 Уу= 1251 
15 53 у у:=84,261 
16 73 У xy = 17,505 
17 81 
18 68 
19 50 
20 76 
21 97 
22 82 
23 53 


a. Fit a straight line to these data. | E 
b. What is the expected average monthly increase for shipments of “Sticky 


shampoo? Is this significant at 10 percent risk? 


PEANUTS / оо voy WANT 
ТО HEAR SOME 

BASEBALL 

STATISTICS, 
CHARLIE BROUN 2 


STATISTICS DONT LIE, 
CHARLIE BROWN 


292 


ACCORDING TO MY FIGURES AS 
OUR PITCHER YOU HAD AN EARNED 
RUN AVERAGE THIS YEAR OF EIGHTY 

RUNS PER GAME ! 


NO, BUT THEY SURE 
SHOOT OFF THEIR 
MOUTH A LOT! 


A Backward 
Glance 


293 


Mecca Community College stu- 
dents turned out to be an interest- 
ing bunch! Now that the material 
in this book has been read, di- 
gested, and, hopefully, learned, it 
might be interesting to you to find 
out something about the students 
in your own school. 

As an exercise, take the ques- 
tions asked of the Mecca students 
and conduct a study in your school. 
Conjecture on the results. Discuss 
the difficulties encountered in 
gathering the data. Comment on 
the problems of getting a random 
sample. After the data are col- 
lected, use the information you’ve 
learned about statistics and analyze 
the data. How does your school 
compare with Mecca Community 
College? у 

We, the authors, would welcome 
your data for future reference and 
for improving the examples in this 
book. When you put your statistics 
together, you may agree with 
Charlie Brown on the opposite 
page that “they sure shoot off their 
mouth a lot!" Good luck. 


Appendix A 
TABLES 


TABLE A-1 


TABLE A-2 


TABLE A-3 


TABLE A-4 


TABLE A-5 


TABLE A-6 


Squares and Square 
Roots 


The Binomial Distribu- 
tion 

The Standard Normal 
Distribution 


Percentiles of the t 
Distributions 


Percentiles of the x^ 
Distributions 


A Short Table of 
Random Digits 


TABLE A-1. Squares and Square Roots 


N? 


100 10000 10.0000 31.6228 
101 10210 10.0499 31.7805 
102 10404 10.0995 31.9374 
103 10609 10.1489 32.0936 
104 10816 10.1980 32.2490 
105 11025 10.2470 32.4037 


106 11236 10.2956 32.5576 
107 11449 10.3441 32.7109 
108 11664 10.3923 32.8634 
109 11881 10.4403 33.0151 
110 12100 10.4881 33.1662 


111 12321 10.5357 33.3167 
112 12544 10.5830 33.4664 
113 12769 10.6301 33.6155 
114 12996 10.6771 33.7639 
115 13225 10.7238 33.9116 


116 13456 10.7703 34.0588 
117 13689 10.8167 34.2053 
118 13924 10.8628 34.3511 
119 14161 10.9087 34.4964 
120 14400 10.9545 34.6410 


121 14641 11.0000 34.7851 
122 14884 11.0454 34.9285 
123 15129 11.0905 35.0714 
124 15376 11.1355 35.2136 
125 15625 11.1803 35.3553 


126 15876 11.2250 35.4965 
127 16129 11.2694 35.6371 
128 16384 11.3137 35.7771 
129 16641 11.3578 35.9166 
130 16900 11.4018 36.0555 


131 17161 11.4455 36.1939 
132 17424 11.4891 36.3318 
133 17689 11.5326 36.4692 
134 17956 11.5758 36.6060 
135 18225 11.6190 36.7423 


136 18496 11.6619 36.8782 
137 18769 11.7047 37.0135 
138 19044 11.7473 37.1484 
139 19321 11.7898 37.2827 
140 19600 11.8322 37.4166 


141 19881 11.8743 37.5500 
142 20164 11.9164 37.5829 
143 20449 11.9583 37.8153 
144 20736 12.0000 37.9473 
145 21025 12.0416 38.0789 


146 21316 12.0830 38.2099 
147 21609 12.1244 38.3406 
148 21904 12.1655 38.4708 
149 22201 12.2066 38.6005 
150 22500 12.2474 38.7298 


УМ 


12.2474 
12.2882 
12.3288 
12.3693 
12.4097 
12.4499 


12.4900 
12.5300 
12.5698 
12.6095 
12.6491 


12.6886 
12.7279 
12.7671 
12.8062 
12.8452 


12.8841 
12.9228 
12.9615 
13.0000 
13.0384 


13.0767 
13.1149 
13.1529 
13.1909 
13.2288 


13.2665 
13.3041 
13.3417 
13.3791 
13.4164 


13.4536 
13.4907 
13.5277 
13.5647 
13.6015 


13.6382 
13.6748 
13.7113 
13.7477 
13.7840 


13.8203 
13.8564 
13.8924 
13.9284 
13.9642 


14.0000 
14.0357 
14.0712 
14.1067 
14.1421 


295 


У 10N 


38.7298 
38.8587 
38.9872 
39.1152 
39.2428 
39.3700 


39.4968 
39.6232 
39.7492 
39.8748 
40.0000 


40.1248 
40.2492 
40.3733 
40.4969 
40.6202 


40.7431 
40.8656 
40.9878 
41.1096 
41.2311 


41.3521 


40 000 
40 401 
40 804 
41 209 
41616 
42025 


42 436 
42 849 
43 264 
43 681 
44 100 


44 521 
44 944 
45 369 
45 796 
46 225 


46 656 
47 089 
47 524 
47 961 
48 400 


48841 
49 284 
49 729 
50 176 
50 625 


51076 
51529 
51984 
52 441 
52 900 


53361 
53824 
54 289 
54756 
55225 


55 696 
56 169 
56 644 
57 121 
57 600 


58 081 
58 564 
59049 
59 536 
60 025 


60516 
61 009 
61504 
62 001 
62 500 


VN 


14.1421 
14.1774 
14.2127 
14.2478 
14.2829 
14.3178 


14.3527 
14.3875 
14.4222 
14.4568 
14.4914 


14.5258 
14.5602 
14.5945 
14.6287 
14.6629 


14.6969 
14.7309 
14.7648 
14.7986 
14.8324 


14.8661 
14.8997 
14.9332 
14.9666 
15.0000 


15.0333 
15.0665 
15.0997 
15.1327 
15.1658 


15.1987 
15.2315 
15.2643 
15.2971 
15.3297 


15.3623 
15.3948 
15.4272 
15.4596 
154919 


15.5242 
15.5563 
15.5885 
15.6205 
15.6525 


15.6844 
15.7162 
15.7480 
15.7797 
15.8114 


V10N 


44.7214 
44.8330 
44.9444 
45.0555 
45.1664 
45.2769 


45.3872 
45.4973 
45.6070 
45.7165 
45.8258 


45.9347 
46.0435 
46.1519 
46.2601 
46.3681 


46.4758 
46.5833 
46.6905 
46.7974 
46.9042 


47.0106 
47.1169 
47.2229 
47.3286 
47.4342 


47.5395 
47.6445 
47.7493 
47.8539 
47.9583 


48.0625 
48.1664 
48.2701 
48.3735 
48.4768 


48.5798 
48.6826 
48.7852 
48.8876 
48.9898 


49.0918 
49.1935 
49.2950 
49.3964 
49.4975 


49.5984 
49.6991 
49.7996 
49.8999 
50.0000 


(TABLE A-1, cont.) 


17.3205 
17.3493 
17.3781 
17.4069 
17.4356 
17.4642 


17.4929 
17.5214 
17.5499 
17.5784 
17.6068 


17.6352 
17.6635 
17.6918 
17.7200 
17.7482 


17.7764 
17.8045 
17.8326 
17.8606 
17.8885 


17.9165 
17.9444 
17.9722 
18.0000 
18.0278 


18.0555 
18.0831 
18.1108 
18.1384 
18.1659 


18.1934 
18.2209 
18.2483 
18.2757 
18.3030 


18.3303 
18.3576 
18.3848 
18.4120 
18.4391 


18.4662 
18.4932 
18.5203 
18.5472 
18.5742 


18.6011 
18.6279 
18.6548 
18.6815 


N? VN VION N? УМ 
62500 15.8114 50.0000 90 000 
63001 15.8430 50.0999 90 601 
63504 15.8745 50.1996 91204 
64009 15.9060 50.2991 91809 
64516 15.9374 50.3984 92 416 
65025 15.9687 50.4975 93 025 
65536 16.0000 50.5964 93 636 
66049 16.0312 50.6952 94 249 
66564 16.0624 50.7937 94 864 
67081 16.0935 50.8920 95 481 
67600 16.1245 50.9902 96 100 
68121 16.1555 51.0882 96 721 
68644 16.1864 51.1859 97 344 
69169 16.2173 51.2835 97 969 
69696 16.2481 51.3809 98 596 
70225 16.2788 51.4782 99 225 
70756 16.3095 51.5752 99 856 
71289 16.3401 51.6720 100 489 
71824 16.3707 51.7687 101 124 
72361 16.4012 51.8652 101 761 
72900 16.4317 51.9615 102 400 
73441 16.4621 52.0577 103 041 
73984 16.4924 52.1536 103 684 
74529 16.5227 52.2494 104 329 
75076 16.5529 52.3450 104 976 
75625 16.5831 52.4404 105 625 
76176 16.6132 52.5357 106 276 
76729 16.6433 52.6308 106 929 
77284 16.6733 52.7257 107 584 
77841 16.7033 52.8205 108 241 
78400 16.7332 52.9150 108 900 
78961 16.7631 53.0094 109 561 
79524 16.7929 53.1037 110 224 
80089 16.8226 53.1977 110 889 
80656 16.8523 53.2917 111556 
81225 16.8819 53.3854 112 225 
81796 16.9115 53.4790 112896 
82369 16.9411 53.5724 113 569 
82944 16.9706 53.6656 114 244 
83521 17.0000 53.7587 114921 
84100 17.0294 53.8516 115 600 
84681 17.0587 53.9444 116 281 
85 264 17.0880 54.0370 116964 
85849 17.1172 54.1295 117 649 
86436 17.1464 54.2218 118 336 
87025 17.1756 54.3139 119025 
87616 17.2047 54.4059 119716 
88209 17.2337 54.4977 120 409 
88804 17.2627 54.5894 121104 
89401 17.2916 54.6809 121801 
90000 17.3205 54.7723 122500 


18.7083 


V10N 


54.7723 
54.8635 
54.9545 
55.0454 
55.1362 
55.2268 


55.3173 
55.4076 
55.4977 
55.5878 
55.6776 


55.7674 
55.8570 
55.9464 
56.0357 
56.1249 


56.2139 
56.3028 
56.3915 
56.4801 
56.5685 


56.6569 
56.7450 
56.8331 
56.9210 
57.0088 


57.0964 
57.1839 
57.2713 
57.3585 
57.4456 


57.5326 
57.6194 
57.7062 
57.7927 
57.8792 


57.9655 
58.0517 
58.1378 
58.2237 
58.3095 


58.3952 
58.4808 
58.5662 
58.6515 
58.7367 


58.8218 
58.9067 
58.9915 
59.0762 
59.1608 


N? 


122 500 
123 201 
123 904 
124 609 
125316 
126 025 


126 736 
127 449 
128 164 
128 881 
129 600 


130 321 
131 044 
131 769 
132 496 
133 225 


133 956 
134 689 
135 424 
136 161 
136 900 


137 641 
138 384 
139 129 
139 876 
140 625 


141 376 
142 129 
142 884 
143 641 
144 400 


145 161 
145 924 
146 689 
147 456 
148 225 


148 996 
149 769 
150 544 
151 321 
152 100 


152 881 
153 664 
154 449 
155 236 
156 025 


156816 
157 609 
158 404 
159 201 
160 000 


VN 
18.7083 
18.7350 
18.7617 
18.7883 


18.8149 
18.8414 


18.8680 
18.8944 
18.9209 
18.9473 
18.9737 


19.0000 
19.0263 
19.0526 
19.0788 
19.1050 


19.1311 
19.1572 
19.1833 
19.2094 
19.2354 


19.2614 
19.2873 
19.3132 
19.3391 
19.3649 


19.3907 
19.4165 
19.4422 
19.4679 
19.4936 


19.5192 
19.5448 
19.5704 
19.5959 
19.6214 


19.6469 
19.6723 
19.6977 
19.7231 
19.7484 


19.7737 
19.7990 
19.8242 
19.8494 
19.8746 


19.8997 
19.9249 
19.9499 
19.9750 
20.0000 


V10N 


59.1608 
59.2453 
59.3296 
59.4138 
59.4979 
59.5819 


59.6657 
59.7495 
59.8331 
59.9166 
60.0000 


60.0833 
60.1664 
60.2495 
60.3324 
60.4152 


60.4979 
60.5805 
60.6630 
60.7454 
60.8276 


60.9098 
60.9918 
61.0737 
61.1555 
61.2372 


61.3188 
61.4003 
61.4817 
61.5630 
61.6441 


61.7252 
61.8061 
61.8870 
61.9677 
62.0484 


62.1289 
62.2093 
62.2896 
62.3699 
62.4500 


62.5300 
62.6099 
62.6897 
62.7694 
62.8490 


62.9285 
63.0079 
63.0872 
63.1664 
63.2456 


(TABLE A-1, cont.) 


N? УМ V10N 


160000 20.0000 63.2456 
401 160801 20.0250 63.3246 
402 161604 20.0499 63.4035 
403 162409 20.0749 63.4823 
404 163216 20.0998 63.5610 
405 164025 20.1246 63.6396 


406 164836 20.1494 63.7181 
407 165649 20.1742 63.7966 
408 166464 20.1990 63.8749 
409 167281 20.2237 63.9531 
410 168100 20.2485 64.0312 


411 168921 20.2731 64.1093 
412 169744 20.2978 64.1872 
413 170569 20.3224 64.2651 
414 171396 20.3470 64.3428 
415 172225 20.3715 64.4205 


416 173056 20.3961 64.4981 
417 173889 20.4206 64.5755 
418 174724 20.4450 64.6529 
419 175561 20.4695 64.7302 
420 176400 20.4939 64.8074 


421 177241 20.5183 64.8845 
422 178084 20.5426 64.9615 
423 178929 20.5670 65.0385 
424 179776 20.5913 65.1153 
425 180625 20.6155 65.1920 


426 181476 20.6398 65.2687 
427 182329 20.6640 65.3452 
428 183184 20.6882 65.4217 
429 184041 20.7123 65.4981 
430 184900 20.7364 65.5744 


431 185761 20.7605 65.6506 
432 186624 20.7846 65.7267 
433 187489 20.8087 65.8027 
434 188356 20.8327 65.8787 
435 189225 20.8567 65.9545 


436 190096 20.8806 66.0303 
437 190969 20.9045 66.1060 
438 191844 20.9284 66.1816 
439 192721 20.9523 66.2571 
440 193600 20.9762 66.3325 


441 194481 21.0000 66.4078 
442 195364 21.0238 66.4831 
443 196249 21.0476 66.5582 
444 197136 21.0713 66.6333 
445 198025 21.0950 66.7083 


446 198916 21.1187 66.7832 
447 199809 21.1424 66.8581 
448 200704 21.1660 66.9328 
449 201601 21.1896 67.0075 
450 202500 21.2132 67.0820 


N? УМ лом 


500 250000 22.3607 70.7107 
501 251001 22.3830 70.7814 
502 252004 22.4054 70.8520 
503 253009 22.4277 70.9225 
504 254016 22.4499 70.9930 
505 255025 22.4722 71.0634 


506 256036 22.4944 71.1337 
507 257049 22.5167 71.2039 
508 258064 22.5389 71.2741 
509 259081 22.5610 71.3442 
510 260100 22.5832 71.4143 


511 261121 22.6053 71.4843 
512 262144 22.6274 71.5542 
513 263169 22.6495 71.6240 
514 264196 22.6716 71.6938 
515 265225 22.6936 71.7635 


516 266256 22.7156 71.8331 
517 267289 22.7376 71.9027 
518 268324 22.7596 71.9722 
519 269361 22.7816 72.0417 
520 270400 22.8035 72.1110 


521 271441 22.8254 72.1803 
522 272484 22.8473 72.2496 
523 273529 22.8692 72.3187 
524 274576 22.8910 72.3878 
525 275625 22.9129 72.4569 


526 276676 22.9347 72.5259 
527 277729 22.9565 72.5948 
528 278784 22.9783 72.6636 
529 279841 23.0000 72.7324 
530 280900 23.0217 72.8011 


531 281961 23.0434 72.8697 
532 283024 23.0651 72.9383 
533 284089 23.0868 73.0068 
534 285156 23.1084 73.0753 
535 286225 23.1301 73.1437 


536 287296 23.1517 73.2120 
537 288369 23.1733 73.2803 
538 289444 23.1948 73.3485 
539 290521 23.2164 73.4166 
540 291600 23.2379 73.4847 


541 292681 23.2594 73.5527 
542 293764 23.2809 73.6206 
543 294849 23.3024 73.6885 
544 295936 23.3238 73.7564 
545 297025 23.3452 73.8241 


546 298116 23.3666 73.8918 
547 299209 23.3880 73.9594 
548 300304 23.4094 74.0270 
549 301401 23.4307 74.0945 
550 302500 23.4521 74.1620 


450 202500, 21.2132 67.0820 
451 203401 21.2368 67.1565 
452 204304 21.2603 67.2309 
453 205209 21.2838 67.3053 
454 206116 21.3073 67.3795 
455 207025 21.3307 67.4537 


456 207936 21.3542 67.5278 
457 208849 21.3776 67.6018 
458 209764 21.4009 67.6757 
459 210681 21.4243 67.7495 
460 211600 21.4476 67.8233 


461 212521 21.4709 67.8970 
462 213444 21.4942 67.9706 
463 214369 21.5174 68.0441 
464 215296 21.5407 68.1175 
465 216225 21.5639 68.1909 


466 217156 21.5870 68.2642 
467 218089 21.6102 68.3374 
468 219024 21.6333 68.4105 
469 219961 21.6564 68.4836 
470 220900 21.6795 68.5565 


471 221841 21.7025 68.6294 
472 222784 21.7256 68.7023 
473 223729 21.7486 68.7750 
474 224676 21.7715 68.8477 
475 225625 21.7945 68.9202 


476 226576 21.8174 68.9928 
477 227529 21.8403 69.0652 
478 228484 21.8632 69.1375 
479 229441 21.8861 69.2098 
480 230400 21.9089 69.2820 


481 231361 21.9317 69.3542 
482 232324 21.9545 69.4252 
483 233289 21.9773 69.4982 
484 234256 22.0000 69.5701 
485 235225 22.0227 69.6419 


486 236196 22.0454 69.7137 
487 237169 22.0681 69.7854 
488 238144 22.0907 69.8570 
489 239121 22.1133 69.9285 
490 240100 22.1359 70.0000 


491 241081 22.1585 70.0714 
492 242064 22.1811 70.1427 
493 243049 22.2036 70.2140 
494 244036 22.2261 70.2851 
495 245025 22.2486 70.3562 


496 246016 22.2711 70.4273 
497 247009 22.2935 70.4982 
498 248004 22.3159 70.5691 
499 249001 22.3383 70.6399 
500 250000 22.3607 70.7107 


297 


(TABLE A-1, cont.) 


302 500 
303 601 
304 704 
305 809 
306 916 
308 025 


309 136 
310 249 
311364 
312481 
313 600 


314721 
315844 
316 969 
318 096 
319 225 


320 356 
321 489 
322 624 
323 761 
324 900 


326 041 
327 184 
328 329 
329 476 
330 625 


331 776 
332 929 
334 084 
335 241 
336 400 


337 561 
338 724 
339 889 
341 056 
342 225 


343 396 
344 569 
345 744 
346 921 
348 100 


349 281 
350 464 
351 649 
352 836 
354 025 


355 216 
356 409 
357 604 
358 801 
360 000 


УМ 


23.4521 
23.4734 
23.4947 
23.5160 
23.5372 
23.5584 


23.5797 
23.6008 
23.6220 
23.6432 
23.6643 


23.6854 
23.7065 
23.7276 
23.7487 
23.7697 


23.7908 
23.8118 
23.8328 
23.8537 
23.8747 


23.8956 
23.9165 
23.9374 
23.9583 
23.9792 


24.0000 
24.0208 
24.0416 
24.0624 
24.0832 


24.1039 
24.1247 
24.1454 
24.1661 
24.1868 


24.2074 
24.2281 
24.2487 
24.2693 
24.2899 


24.3105 
24.3311 
24.3516 
24.3721 
24.3926 


24.4131 
24.4336 
24.4540 
24.4745 
24.4949 


74.1620 
74.2294 
74.2967 
74.3640 
74.4312 
74.4983 


74.5654 
74.6324 
74.6994 
74.7663 
74.8331 


74.8999 
74.9667 
75.0333 
75.0999 
75.1665 


75.2330 
75.2994 
75.3658 
75.4321 
75.4983 


75.5645 
75.6307 
75.6968 
75.7628 
75.8288 


75.8947 
75.9605 
76.0263 
76.0920 
76.1577 


76.2234 
76.2889 
76.3544 
76.4199 
76.4853 


76.5506 
76.6159 
76.6812 
76.7463 
76.8115 


76.8765 
76.9415 
77.0065 
77.0714 
77.1362 


77.2010 
77.2658 
77.3305 
77.3951 
77.4597 


м м УМ V10N 


360 000 
361 201 
362 404 
363 609 
364 816 
366 025 


367 236 
368 449 
369 664 
370 881 
372 100 


373 321 
374 544 
375 769 
376 996 
378 225 


379 456 
380 689 
381 924 
383 161 
384 400 


385 641 
386 884 
388 129 
389 376 


390 625 


391 876 
393 129 
394 384 
395 641 
396 900 


398 161 
399 424 
400 689 
401 956 
403 225 


404 496 
405 769 
407 044 
408 321 
409 600 


410 881 
412 164 
413 449 
414 736 
416 025 


417 316 
418 609 
419 904 
421 201 
422 500 


24.4949 
24.5153 
24.5357 
24.5561 
24.5764 
24.5967 


24.6171 
24.6374 
24.6577 
24.6779 
24.6982 


24.7184 
24.7386 
24.7588 
24.7790 
24.7992 


24.8193 
24.8395 
24.8596 
24.8797 
24.8998 


24.9199 
24.9399 
24.9600 
24.9800 
25.0000 


25.0200 
25.0400 
25.0599 
25.0799 
25.0998 


25.1197 
25.1396 
25.1595 
25.1794 
25.1992 


25.2190 
25.2389 
25.2587 
25.2784 
25.2982 


25.3180 
25.3377 
25.3574 
25.3772 
25.3969 


25.4165 
25.4362 
25.4558 
25.4755 
25.4951 


77.4597 
77.5242 
77.5887 
77.6531 
77.7174 
77.7817 


77.8460 
77.9102 
77.9744 
78.0385 
78.1025 


78.1665 
78.2304 
78.2943 
78.3572 
78.4219 


78.4857 
78.5493 
78.6130 
78.6766 
78.7401 


78.8036 
78.8670 
78.9303 
78.9937 
79.0569 


79.1202 
79.1833 
79.2465 
79.3095 
79.3725 


79.4355 
79.4984 
79.5613 
79.6241 
79.6869 


79.7496 
79.8123 
79.8749 
79.9375 
80.0000 


80.0625 
80.1249 
80.1873 
80.2496 
80.3119 


80.3741 
80.4362 
80.4984 
80.5605 
80.6226 


422 500 
423 801 
425 104 
426 409 
427716 
429 025 


430 336 
431 649 
432 964 
434 281 
435 600 


436 921 
438 244 
439 569 
440 896 
442 225 


443 556 
444 889 
446 224 
447 561 
448 900 


450 241 
451 584 
452 929 
454 276 
455 625 


456 976 
458 329 
459 684 
461041 
462 400 


463 761 
465 124 
466 489 
467 856 
469 225 


470 596 
471 969 
473 344 
474721 
476 100 


477 481 
478 864 
480 249 
481 636 
483 025 


484 416 
485 809 
487 204 
488 601 
490 000 


VN 


25.4951 
25.5147 
25.5343 
25.5539 
25.5734 
25.5930 


25.6125 
25.6320 
25.6515 
25.6710 
25.6905 


25.7099 
25.7294 
25.7488 
25.7682 
25.7876 


25.8070 
25.8263 
25.8457 
25.8650 
25.8844 


25.9037 
25.9230 
25.9422 
25.9615 
25.9808 


26.0000 
26.0192 
26.0384 
26.0576 
26.0768 


26.0960 
26.1151 
26.1343 
26.1534 
26.1725 


26.1916 
26.2107 
26.2298 
26.2488 
26.2679 


26.2869 
26.3059 
26.3249 
26.3439 
26.3629 


26.3818 
26 4008 
26.4197 
26.4386 
26.4575 


VI0N 


80.6226 
80.6846 
80.7465 
80.8084 
80.8703 
80.9321 


80.9938 
81.0555 
81.1172 
81.1788 
81.2404 


81.3019 
81.3634 
81.4248 
81.4862 
81.5475 


81.6088 
81.6701 
81.7313 
81.7924 
81.8535 


81.9146 
81.9756 
82.0366 
82.0975 
82.1584 


82.2192 
82.2800 
82.3408 
82.4015 
82.4621 


82.5227 
82.5833 
82.6438 
82.7043 
82.7647 


82.8251 
82.8855 
82.9458 
83.0060 
83.0662 


83.1264 
83.1865 
83.2466 
83.3067 
83.3667 


83.4266 
83.4865 
83.5464 
83.6062 
83.6660 


(TABLE A-1, cont.) 


800 640000 28.2843 89.4427 
801 641601 28.3019 89.4986 
802 643204 28.3196 89.5545 
803 644809 28.3373 89.6103 
804 646416 28.3549 89.6660 
805 648025 28.3725 89.7218 


806 649636 28.3901 89.7775 
807 651249 28.4077 89.8332 
808 652864 28.4253 89.8888 
809 654481 28.4429 89.9444 
810 656100 28.4605 90.0000 


811 657721 28.4781 90.0555 
812 659344 28.4956 90.1110 
813 660969 28.5132 90.1665 
814 662596 28.5307 90.2219 
815 664225 28.5482 90.2774 


700 490000 26.4575 83.6660 
701 491401 26.4764 83.7257 
702 492804 26.4953 83.7854 
703 494209 26.5141 83.8451 
704 495616 26.5350 83.9047 
705 497025 26.5518 83.9643 


706 498436 26.5707 84.0238 
707 499849 26.5895 84.0833 
708 501264 26.6083 84.1427 
709 502681 26.6271 84.2021 
710 504100 26.6458 84.2615 


711 505521 26.6646 84.3208 
712 506944 26.6833 84.3801 
713 508369 26.7021 84.4393 
714 509796 26.7208 84.4985 
715 511225 26.7395 84.5577 


716 512656 26.7582 84.6168 
717 514089 26.7769 84.6759 
718 515524 26.7955 84.7349 
719 516961 26.8142 84.7939 
720 518400 26.8328 84.8528 


721 519841 26.8514 84.9117 
722 521284 26.8701 84.9706 
723 522729 26.8887 85.0294 
724 524176 26.9072 85.0882 
725 525625 26.9258 85.1469 


726 527076 26.9444 85.2056 
727 528529 26.9629 85.2643 
728 529984 26.9815 85.3229 
729 531441 27.0000 85.3815 
730 532900 27.0185 85.4400 


731 534361 27.0370 85.4985 
732 535824 27.0555 85.5570 
733 537289 27.0740 85.6154 
734 538756 27.0924 85.6738 
735 540225 27.1109 85.7321 


736 541696 27.1293 85.7904 
737 543169 27.1477 85.8487 
738 544644 27.1662 85.9069 
739 546121 27.1846 85.9651 
740 547600 27.2029 86.0233 


741 549081 27.2213 86.0814 
742 550564 27.2397 86.1394 
743 552049 27.2580 86.1974 
744 553536 27.2764 86.2554 
745 555025 27.2947 86.3134 


746 556516 27.3130 86.3713 
747 558009 27.3313 86.4292 
748 559504 27.3496 86.4870 
749 561001 27.3679 86.5448 
750 562500 27.3861 86.6025 


750' 562500 27.3861 86.6025 
751 564001 27.4044 86.6603 
752 565504 27.4226 86.7179 
753 567009 27.4408 86.7756 
754 568516 27.4591 86.8332 
755 570025 27.4773 86.8907 


756 571536 27.4955 86.9483 
757 573049 27.5136 87.0057 
758 574564 27.5318 87.0632 
759 576081 27.5500 87.1206 
760 577600 27.5681 87.1780 


761 579121 27.5862 87.2353 
762 580644 27.6043 87.2926 
763 582169 27.6225 87.3499 
764 583696 27.6405 87.4071 
765 585225 27.6586 87.4643 


766 586756 27.6767 87.5214 |816 665856 28.5657 90.3327 
767 588289 27.6948 87.5785 |817 667489 28.5832 90.3881 
768 589824 27.7128 87.6356 |818 669 124 28.6007 90.4434 
769 591361 27.7308 87.6926 |819 670761 28.6182 90.4986 
770 592900 27.7489 87.7496 |820 672400 28.6356 90.5539 


771 594441 27.7669 87.8066 |821 674041 28.6531 90.6091 
772 595984 27.7849 87.8635 |822 675684 28.6705 90.6642 
773 597 529 27.8029 87.9204 |823 677 329 28.6880 90.7193 
774 599076 27.8209 87.9773 |824 678 976 28.7054 90.7744 
775 600625 27.8388 88.0341 |825 680 625 28.7228 90.8295 


776 602 176 27.8568 88.0909 |826 682 276 28.7402 90.8845 
777 603729 27.8747 88.1476 |827 683 929 28.7576 90.9395 
778 605 284 27.8927 88.2043 |828 685 584 28.7750 90.9945 
779 606841 27.9106 88.2610 |829 687 241 28.7924 91.0494 
780 608400 27.9285 88.3176 |830 688 900 28.8097 91.1043 


781 609961 27.9464 88.3742 |831 690 561 28.8271 91.1592 
782 611 524 27.9643 88.4308 |832 692 224 28.8444 91.2140 
783 613089 27.9821 88.4873 |833 693 889 28.8617 91.2688 
784 614656 28.0000 88.5438 |834 695556 28.8791 91.3236 
785 616225 28.0179 88.6002 |835 697225 28.8964 91.3783 


786 617796 28.0357 88.6566 |836 698896 28.9137 91.4330 
787 619369 28.0535 88.7130 |837 700569 28.9310 91.4877 
788 620944 28.0713 88.7694 |838 702244 28.9482 91.5423 
789 622521 28.0891 88.8257 |839 703921 28.9655 91.5969 
790 624100 28.1069 88.8819 |840 705600 28.9828 91.6515 


791 625681 28.1247 88.9382 |841 707281 29.0000 91.7061 
792 627264 28.1425 88.9944 |842 708964 29.0172 91.7606 
793 628849 28.1603 89.0505 | 843 710649 29.0345 91.8150 
794 630436 28.1780 89.1067 |844 712336 29.0517 91.8695 
795 632025 28.1957 89.1628 |845 714025 29.0689 91.9239 


796 633616 28.2135 89.2188 |846 715716 29.0861 91.9783 
797 635209 28.2312 89.2749 |847 717409 29.1033 92.0326 
798 636804 28.2489 89.3308 |848 719104 29.1204 92.0869 
799 638401 28.2666 89.3868 | 849 720801 29.1376 92.1412 
800 640000 28.2843 89.4427 B50 722500 29.1548 92.1954 


299 


(TABLE A-1, cont.) 
УМ VION 


850 722500 29.1548 92.1954 810000 30.0000 4 950 902500 30.8221 97.4679 
851 724201 29.1719 92.2497 811810 30.0167 951 904401 30.8383 97.5192 
852 725904 29.1890 92.3038 813 604 30.0333 952 906304 30.8545 97.5705 
853 727609 29.2062 92.3580 815409 30.0500 953 908209 30.8707 97.6217 


854 729316 29.2233 92.4121 817216 30.0666 954 910116 30.8869 97.6729 
855 731025 29.2404 92.4662 819025 30.0832 955 912025 30.9031 97.7241 
856 732736 29.2575 92.5203 820836 30.0998 956 913936 30.9192 97.7753 
857 734449 29.2746 92.5743 822 649 30.1164 957 915849 30.9354 97.8264 


858 736164 29.2916 92.6283 824 464 30.1330 5 958 917764 30.9516 97.8775 
859 737881 29.3087 92.6823 826281 30.1496 А 959 919 681 30.9677 97.9285 
860 739 600 29.3258 92.7362 828 100 30.1662 960 921 600 30.9839 97.9796 
861 741 321 29.3428 92.7901 829 921 30.1828 961 923 521 31.0000 98.0306 


862 743044 29.3598 92.8440 831 744 30.1993 962 925 444 31.0161 98.0816 
863 744 769 29.3769 92.8978 833 569 30.2159 963 927 369 31.0322 98.1326 
864 746 496 29.3939 92.9516 835 396 30.2324 95. 964 929 296 31.0483 98.1835 
865 748 225 29.4109 93.0054 837 225 30.2490 965 931 225 31.0644 98.2344 
866 749 956 29.4279 93.0591 839 056 30.2655 966 933 156 31.0805 98.2853 
867 751 689 29.4449 93.1128 840 889 30.2820 967 935 089 31.0966 98.3362 
868 753 424 29.4618 93.1665 842 724 30.2985 968 937 024 31.1127 98.3870 
869 755 161 29.4788 93.2202 844 561 30.3150 969 938 961 31.1288 98.4378 


870 756 900 29.4958 93.2738 846 400 30.3315 970 940900 31.1448 98.4886 
871 758 641 29.5127 93.3274 848 241 30.3480 971 942841 31.1609 98.5393 
872 760 384 29.5296 93.3809 850 084 30.3645 96.0208 | 972 944784 31.1769 98.5901 
873 762 129 29.5466 93.4345 851 929 30.3809 96.0729 | 973 946729 31.1929 98.6408 


874 763 876 29.5635 93.4880 853 776 30.3974 96.1249 | 974 948676 31.2090 98.6914 
875 765 625 29.5804 93.5414 855 625 30.4138 96.1769 | 975 950 625 31.2250 98.7421 
876 767 376 29.5973 93.5949 857 476 30.4302 96.2289 | 976 952576 31.2410 98.7927 
877 769 129 29.6142 93.6483 859 329 30.4467 96.2808 | 977 954529 31.2570 98.8433 
878 770884 29.6311 93.7017 861 184 30.4631 96.3329 | 978 956 484 31.2730 98.8939 
879 772641 29.6479 93.7550 863041 30.4795 96.3846| 979 958441 31.2890 98.9444 
880 774400 29.6648 93.8083 864900 30.4959 96.4365| 980 960400 31.3050 98.9949 
881 776161 29.6816 93.8616 866761 30.5123 96.4883 | 981 962361 31.3209 99.0454 
882 777924 29.6985 93.9149 868624 30.5287 96.5401| 982 964324 31.3369 99.0959 
883 779689 29.7153 93.9681 870489 30.5450 96.5919 | 983 966 289 31.3528 99.1464 


884 781 456 29.7321 94.0213 872 356 30.5614 96.6437 | 984 968 256 31.3688 99.1968 
885 783 225 29.7489 94.0744 874225 30.5778 96.6954 | 985 970 225 31.3847 99.2472 


886 784 996 29.7658 94.1276 876 096 30.5941 96.7471 | 986 972 196 31.4006 99.2975 


887 786 769 29.7825 94.1807 877 969 30.6105 96.7988 | 987 974 169 31.4116 99.3479 
888 788 544 29.7993 94.2338 879 844 30.6268 96.8504 | 988 976 144 31.4325 99.3982 
889 790 321 29.8161 94.2868 881 721 30.6431 96.9020 | 989 978 121 31.4484 99.4485 


890 792 100 29.8329 94.3398 883 600 30.6594 96.9536 | 990 980 100 31.4643 99.4987 


891 793881 29.8496 94.3928 885481 30.6757 97.0052| 991 982081 31.4802 99.5490 
892 795664 29.8664 94.4458 887364 30.6920 97.0567| 992 984064 31.4960 99.5992 
893 797449 29.8831 94.4987 889249 30.7083 97.1082| 993 986049 31.5119 99.6494 
894 799236 29.8998 94.5516 891 136 30.7246 97.1597 | 994 988036 31.5278 99.6995 
895 801025 29.9166 94.6044 893025 30.7409 97.2111| 995 990025 32.5436 99.7497 


896 802816 29.9333 94.6573 894.916 30.7571 97.2625| 996 992016 31.5595 99.7998 
897 804609 29.9500 94.7107 896809 30.7734 97.3139| 997 994009 31.5753 99.8499 
898 806404 29.9666 94.7629 898704 30.7896 97.3653| 998 996004 31.5911 99.8999 
899 808201 29.9833 94.8156 900601 30.8058 97.4166| 999 998001 31.6070 99.9500 
900 810000 30.0000 94.8683 902500 30.8221 97.4679 | 1000 1000000 31.6228 100.0000 


300 


TABLE A-2. The Binomial Distribution 
For designated values of n and p, the tabled entry gives P(Y xy). 


n 


5 


20 


р=ло  p-.20 р=.25  p-.30  p-40 p=.50 
ELE У ee 0 T4 aC Rt 7. 
.5905 3277 .2373 .1681 .0778 .0312 
.9185 7373 6328 5282 3370 „1875 
9914 9421 8965 8369 6826 5000 
9995 .9933 .9844 .9692 .9130 8125 
1.0000 9997 9990 9976 9898 9688 
1.0000 1.0000 1.0000 1.0000 1.0000 
.3487 1074 .0563 .0282 .0060 0010 
7361 .3758 .2440 1493 0464 0107 
9298 6778 5256 .3828 .1673 .0547 
.9872 8791 7759 6496 .3823 1719 
.9984 .9672 9219 8497 6331 3770 
9999 9936 .9803 9527 8338 6230 
1.0000 9991 9965 9894 9452 8281 
.9999 .9996 .9984 .9877 .9453 
1.0000 1.0000 .9999 .9983 .9893 
1.0000 .9999 .9990 
1.0000 1.0000 
.2059 .0352 0134 0047 0005 0000 
.5490 1671 0802 0353 .0052 .0005 
8159 3980 2361 1268 0271 .0037 
19444 6482 4613 .2969 .0905 0176 
9873 8358 6865 5155 2173 .0592 
.9978 .9389 8516 7216 4032 “1509 
.9997 9819 9434 8689 6098 3036 
1.0000 9958 9827 9500 7869 .5000 
.9992 .9958 .9848 .9050 6964 
9999 9992 9963 9662 8491 
1.0000 .9999 .9993 .9907 .9408 
1.0000 .9999 .9981 .9824 
1.0000 .9997 .9963 
1.0000 .9995 
1.0000 
1216 0115 0032 0008 0000 .0000 
.3917 .0692 .0243 .0076 .0005 .0000 
.6769 .2061 .0913 .0355 .0036 .0002 
8670 4114 2252 1071 0160 0013 
9568 6296 „4148 2375 .0510 .0059 
.9887 .8042 6172 4164 1256 0207 
9976 9133 7858 6080 ‚2500 0577 
9996 9679 8982 7723 „4159 „1316 
.9999 .9900 .9591 .8867 5956 2517 
1.0000 .9974 .9861 .9520 7553 4119 
.9994 .9961 .9829 8725 5881 
9999 .9991 .9949 .9435 7483 
1.0000 9998 9987 9790 `8684 
1.0000 9997 9935 9423 
1.0000 9984 9793 
9997 9941 
1.0000 9987 
9998 
1.0000 
а T Ip qp T Ран 


~ 


= 
оо~олљеомноо CHOMINAMAMARWNHHO лљем=о 


301 


TABLE A-3. The Standard Normal Distribution 


Ф 


.00001 
.00003 


a 


.00007 
.0001 
.0002 


m 
со 


о 


-0003 
.0005 
.0007 
.0010 
.0013 


№ “8 о ооо ымы 


NNNN 


NN 
ww 
~ 
o 


.0019 
.0026 
.0035 
.0047 
.0050 
.0062 


M 
> 


.0082 
-0100 
.0107 
.0139 
.0179 
-0200 
.0227 


ааа а utut ere Ert 
[CM 
Бевза 
= N o 


г> 
о л 


.0250 
.0287 
-0300 
0359 
-0400 
.0446 
-0500 


=d эл = 
ooo 
л 
a 


302 


TABLE A-4. Percentiles of the t Distributions 


d.f. 55 65 

1 0.158 0.510 

2 0.142 0.445 

3 0.137 0.424 

4 0.134 0.414 

5 0.132 0.408 

6 0.131 0.404 

7 0.130 0.402 

8 0.130 0.399 

9 0.129 0.398 
10 0.129 0.397 
11 0.129 0.396 
12 0.128 0.395 
13 0.128 0.394 
14 0.128 0.393 
15 0.128 0.393 
16 0.128 0.392 
17 0.128 0.392 
18 0.127 0.392 
19 0.127 0.391 
20 0.127 0.391 
21 0.127 0.391 
22 0.127 0.390 
23 0.127 0.390 
24 0.127 0.390 
25 0.127 0.390 
26 0.127 0.390 
27 0.127 0.389 
28 0.127 0.389 
29 0.127 0.389 
30 0.127 0.389 
35 0.127 0.388 
40 0.126 0.388 
45 0.126 0.388 
50 0.126 0.388 
60 0.126 0.387 
70 0.126 0.387 
80 0.126 0.387 
90 0.126 0.387 
100 0.126 0.386 
120 0.126 0.386 
140 0.126 0.386 
160 0.126 0.386 
180 0.126 0.386 
200 0.126 0.386 
© 0.126 0.385 


1.000 
0.816 
0.765 
0.741 
0.727 


0.718 
0.711 
0.706 
0.703 
0.700 


0.697 
0.695 


0.692 
0.691 


0.687 
0.686 


0.685 
0.685 


85 90 95 97.5 99 99.5 
1.963 3.078 6.314 12.706 31821 63.657 
1.886 1.886 2.920 4.303 6.965 9.925 
1.250 1.638 2.353 3.182 4.541 5.841 
1.190 1.533 2.132 2.776 3.747 4.604 
1156 1.476 2.015 2.571 3.365 4.032 
1.3134 1.440 1.943 2.447 3.143 3.707 
1.119 1.415 1.895 2.365 2.998 3.499 
1.108 1.397 1.860 2.306 2.896 3.355 
1.100 1.383 1.833 2.262 2.821 3.250 
1.093 1.372 1.812 2.228 2.764 3.169 
1.088 1.363 1.796 2.201 2.718 3.106 
1.083 1.356 1.782 2.179 2.681 3.055 
1.079 1.350 1.771 2.160 2.650 3.012 
1.076 1.345 1.761 2.145 2.624 2.977 
1.074 1.341 1.753 2.131 2.602 2.947 
1.071 1.337 1.746 2.120 2.583 2.921 
1.069 1.333 1.740 2.110 2.567 2.898 
1.067 1.330 1.734 2.101 2.552 2.878 
1.066 1.328 1.729 2.093 2.539 2.861 
1.064 1.325 1.725 2.086 2.528 2.845 
1.063 1.323 1.721 2.080 2.518 2.831 
1.061 1.321 1.717 2.074 2.508 2.819 
1.060 1.319 1.714 2.069 2.500 2.807 
1.059 1.318 1.711 2.064 2.492 2.797 
1.058 1.316 1.708 2.060 2.485 2.787 
1.058 1.315 1.706 2.056 2.479 2.779 
1.057 1.314 1.703 2.052 2.473 2.771 
1.056 1.313 1.701 2.048 2.467 2.763 
1.055 1.311 1.699 2.005 2.462 2.756 
1.055 1.310 1.697 2.042 2.457 2.750 
1.052 1.306 1.690 2.030 2.438 2.724 
1.050 1.303 1.684 2.021 2.423 2.704 
1.049 1.301 1.679 2.014 2.412 2.690 
1.047 1.299 1.676 2.009 2.403 2.678 
1.045 1.296 1.671 2.000 2.390 2.660 
1.044 1.294 1.667 1.994 2.381 2.648 
1.043 1.292 1.664 1.990 2.374 2.639 
1.042 1.291 1.662 1.987 2.368 2.632 
1.042 1.290 1.660 1.984 2.364 2.626 
1.041 1.289 1.658 1.980 2.358 2.617 
1.040 1.288 1.656 1.977 2.353 2.611 
1.040 1.287 1.654 1.975 2.350 2.607 
1.039 1.286 1.653 1.973 2.347 2.603 
1.039 1.286 1.653 1.972 2.345 2.601 
1.036 1.282 1.645 1.960 2.326 2.576 


99.95 


636.619 
31.599 
12.924 

8.610 
6.869 


5.959 
5.408 
5.041 
4.781 
4.587 


4.437 
4.318 
4.221 
4.140 
4.073 


4.015 
3.965 
3.922 
3.883 
3.850 


3.819 
3.792 
3.768 
3.745 
3.725 


3.707 
3.690 
3.674 
3.659 
3.646 


3.591 
3.551 
3.520 
3.496 
3.460 


3.435 
3.416 
3.402 
3.390 
3.373 


3.361 
3.352 
3.345 
3.340 
3.291 


TABLE А-5 Percentiles of ће Chi-square 


0.5 1 2.5 5 10 20 30 40 50 


0.00004 0.0002 0.001 0.004 0.016 0.064 0.148 0.275 0.455 
0.010 0.020 0.051 0.103 0.211 0.446 0.713 1.022 1.386 
0.072 0.115 0.216 0.352 0.584 1.005 1.424 1.869 2.366 
0.207 0.297 0.484 0.711 1.064 1.649 2.195 2.753 3.357 
0.412 0.554 0.831 1.145 1.610 2.343 3.000 3.655 4.351 


i 0.872 1.237 1.635 2.204 3.070 3.828 4.570 5.348 
0.989 1.239 1.690 2.167 2.833 3.822 4.671 5.493 6.346 
1.344 1.646 2.180 2.733 3.490 4.594 5.527 6.423 7.344 
1.735 2.088 2.700 3.325 4.168 5.380 6.393 7.357 8.343 
2.156 2.558 3.247 3.940 4.865 6.179 7.207 8.295 9.342 


11 2.603 3.053 3.816 4.575 5.578 6.989 8.148 9.237 10.341 
12 3.074 3.571 4.404 5.226 6.304 7.807 9.034 10.182 11.340 
13 3.565 4.107 5.009 5.892 7042 8.634 9.926 11.129 12.340 
14 4.075 4.660 5.629 6.571 7.790 9.467 10.821 12.078 13.339 
15 4.601 5.229 6.262 7.261 8.547 10.307 11.721 13.030 14.339 


16 5.142 5.812 6.908 7.962 9.312 11.152 12.624 13.983 15.338 
17 5.697 6.408 7.564 8.672 10.085 12.002 13.531 14.937 16.338 
18 6.265 7.015 8.231 9.390 10.865 12.857 14.440 15.893 17.338 
19 6.844 7.633 8.907 10.117 11.651 13.716 15.352 16.850 18.338 
20 7.434 8.260 9.591 10.851 12.443 14.578 16.266 17.809 19.337 


21 8.034 8.897 10.283 11.591 13.240 15.445 17.182 18.768 20.337 
22 8.643 9.542 10.982 12.338 14.041 16.314 18.101 19.729 21.337 
23 9.260 10.196 11.689 13.091 14.848 17.187 19.021 20.690 22.337 
24 9.886 10.856 12.401 13.848 15.659 18.062 19.943 21.752 23.337 
256 10.520 11.524 13.120 14.611 16.473 18.940 20.867 22.616 24.337 


26 11.160 12.198 13.844 15.379 17.292 19.820 21.792 23.579 25.336 
27 11.808 12.879 14.573 16.151 18.114 20.703 22.719 24.544 26.336 
28 12.461 13.565 15.308 16.928 18.939 21.588 23.647 25.509 27.336 
29 13.121 14.256 16.047 17.708 19.768 22.475 24.577 26.475 28.336 
30 13.787 14.953 16.791 18.493 20.599 23.364 25.508 27.442 29.336 


35 17.192 18.509 20.569 22.465 24.797 27.836 30.178 32.282 34.336 
40 20.707 22.164 24.433 26.509 29.051 32.345 34.872 37.134 39.335 
45 24.311 25.901 28.366 30.612 33.350 36.884 39.585 41.995 44.335 
50 27.991 29.707 32.357 34.764 37.689 41.449 44.313 46.864 49.335 
60 35.534 37.485 40.482 43.188 46.459 50.641 53.809 56.620 59.335 


70 43.275 45.442 48.758 51.739 55.329 59.898 63.346 66.396 69.334 
80 51.172 53.540 57.153 60.391 64.278 69.207 72.915 76.188 79.334 
90 59.196 61.754 65.647 69.126 73.291 78.558 82.511 85.993 89.334 
100 67.328 70.065 74.222 77.929 82.358 87.945 92.129 95.808 99.334 
120 83.852 86.923 91.573 95.705 100.624 106.806 111.419 115.465 119.334 


140 100.655 104.034 109.137 113.659 119.029 125.758 130.766 135.149 139.334 
160 117.679 121.346 126.870 131.756 137.546 144.783 150.158 154.856 159.334 
180 134.884 138.820 144.741 149.969 156.153 163.868 169.588 174.580 179.334 
200 152.241 156.432 162.728 168.279 174.835 183.003 189.049 194.319 199.334 


аи 


E 
оос зо NAUN- 
о 
о 
~ 
o 


304 


x’ Distributions 


60 70 80 90 95 97.5 99 99.5 99.95 


0.708 1.074 1.642 2.706 3.841 5.024 6.635 7.879 12.116 


1 
1.833 2.408 3.219 4.605 5.991 7.378 9.210 10.597 15.202 2 
2.946 3.665 4.642 6.251 7.815 9.348 11.345 12.838 17.730 3 
4.045 4.878 5.989 7.779 9.488 11.143 13.277 14860 19.997 4 
5.132 6.064 7.289 9.236 11.070 12.833 15.086 16.750 22.105 5 
6.211 7.231 8.558 10.645 12.592 14.449 16.812 18.548 24.103 6 
7.283 8.383 9.803 12.017 14.067 16.013 18.475 20.278 26.018 7 
8.351 9.524 11030 13.362 15.507 17.535 20.090 21.955 27.868 8 
9.414 10.656 12.242 14.684 16.919 19.023 21.666 23.589 29.666 9 
10.473 11.781 13.442 15.987 18.307 20.483 23.209 25.188 31.420 10 
11.530 12.899 14.631 17.275 19.675 21.920 24.725 26.757 33.137 11 


12.584 14.011 15.812 18.549 21.026 23.337 26.217 28.300 34.821 12 
13.636 15.119 16.985 19.812 22.362 24.736 27.688 29.819 36.478 13 
14.685 16.222 18.151 21.064 23.685 26.119 29.141 31.319 38.109 14 
15.733 17.322 19.311 22.307 24.996 27.488 30.578 32.801 39.719 15 


16.780 18.418 20.465 23.542 26.296 28.845 32.000 34.267 41.308 16 
17.824 19.511 21.615 24.769 27.587 30.191 33.409 35.718 42.879 17 
18.868 20.601 22.760 25.989 28.869 31.526 34.805 37.156 44.434 18 
19.910 21.689 23.900 27.204 30.144 32.852 36.191 38.582 45.973 19 
20.951 22.775 25.038 28.412 31.410 34.170 37.566 39.997 47.498 20 


21.991 23.858 26.171 29.615 32.671 35.479 38.932 41.401 49.011 21 
23.031 24.939 27.301 30.813 33.924 36.781 40.289 42.796 50.51 1 22 
24.069 26.018 28.429 32.007 35.172 38.076 41.638 44.181 52.000 23 
25.106 27.096 29.553 33.196 36.415 39.364 42.980 45.559 53.479 24 
26.143 28.172 30.675 34.382 37.652 40.646 44.314 46.928 54.947 25 


27.179 29.246 31.795 35.563 38.885 41.923 45.642 48.290 56.407 26 
28.214 30.319 32.912 36.741 40.113 43.195 46.963 49.645 57.858 27 
29.249 31.391 34.027 37.916 41.337 44.461 48.278 50.993 59.300 28 
30.283 32.461 35.139 39.087 42.557 45.722 49.588 52.336 60.735 29 
31.316 33.530 36.250 40.256 43.773 46.979 50.892 53.672 62.162 30 


36.475 38.859 41.778 46.059 49.802 53.203 57.342 60.275 69.199 35 
41.622 44.165 47.269 51.805 55.758 59.342 63.691 66.766 76.095 40 
46.761 49.452 52.729 57.505 61.6566 65.410 69.957 73.166 82.876 45 
51.892 54.723 58.164 63.167 67.505 71.420 76.154 79.490 89.561 50 
62.135 65.227 68.972 74.397 79.082 83.298 88.379 91.952 102.695 60 


72.358 75.689 79.715 85.527 90.531 95.023 100.425 104.215 115.578 70 
82.566 86.120 90.405 96.578 101.879 106.629 112.329 116.321 128.261 80 
92.761 96.524 101.054 107.565 113.145 118.136 124.116 128.299 140.782 90 
102.946 106.906 111.667 118.498 124.342 129.561 135.807 140.169 153.167 100 
123,289 127.616 132.806 140.233 146.567 152.211 158.950 163.648 177.603 120 


143.604 148.269 153.854 161.827 168.613 174.648 181.840 186.847 201.683 140 
163.898 168.876 174.828 183.311 190.516 196.915 204.530 209.824 225.481 160 
184.173 189.446 195.743 204.704 212.304 219.044 227.056 232.620 249.048 180 


204.434 209.985 216.609 226.021 233.994 241.058 249.445 255.264 272.423 200 


305 


TABLE А-6 А Short Table of Random Digits* 


03000 
03001 
03002 
03003 
03004 


03005 
03006 
03007 
03008 
03009 


03010 
03011 
03012 
03013 
03014 


03015 
03016 
03017 
03018 
03019 


03020 
03021 
03022 
03023 
03024 


03025 
03026 
03027 
03028 
03029 


03030 
03031 
03032 
03033 
03034 


03035 
03036 
03037 
03038 
03039 


03040 
03041 
03042 
03043 
03044 


03045 
03046 
03047 
03048 
03049 


95429 
54933 
49242 
41240 
02049 


27000 
91285 
44887 
72892 
17812 


95921 
40023 
80560 
47712 
80406 


56642 
74682 
45524 
57139 
00705 


05549 
22327 
86018 
70839 
67411 


28259 
04503 
63288 
63701 
96540 


16744 
75423 
48129 
81433 
81800 


07751 
26480 
09458 
91478 
36033 


22404 
94658 
39921 
36744 
16956 


33291 
63079 
47767 
76432 
04180 


05023 
30376 
61815 
22056 
38223 


12588 
54345 
26995 
36748 
43757 


10413 
28286 
81722 
67899 
13822 


53730 
44393 
46409 
42113 
17743 


70101 
55480 
85335 
18951 
88645 


11822 
79028 
93134 
54429 
71159 


20236 
86481 
76751 
53248 
27936 


33731 
31002 
04923 
94042 
56105 


60701 
81609 
05493 
75178 
15978 


76222 
72605 
52203 
63190 
18717 


42445 
60217 
36878 
66879 
10899 


72677 
34034 
74237 
42544 
07029 


86215 
92943 
72870 
21900 
81956 


56710 
69167 
12928 
85637 
58687 


92945 
95907 
88367 
33484 
77789 


53058 
63404 
91378 
86992 
93478 


89506 
84147 
53494 
63646 
60474 


44285 


81479 
12916 
72093 
48908 
13677 


50662 
37310 
88921 
00765 
78410 


85039 
41583 
86454 
18132 
08991 


70241 
41446 
46822 
92492 
88502 


27189 
77989 
10506 
00091 
72757 


16516 
03482 
91988 
02004 
97919 


63437 
82941 
04187 
64467 
34712 


96913 
49074 
70053 
63299 
23328 


20791 
72224 
93414 
27882 
32638 


07482 
42852 
32254 
04588 
32422 


06582 
71034 
63479 
44909 
14360 


96047 
55291 
18037 
29826 
93762 


35246 
27563 
57429 
29785 
95359 


10522 
43982 
21407 
06469 
80485 


95527 
88732 
33585 
74367 
59927 


20196 
76677 
12157 
28351 
43501 


81051 
47088 
79955 
48089 
90626 


33077 
16354 
69033 
86572 
59888 


60898 
93683 
02831 
87981 
46917 


83308 
14681 
31966 
27277 
38615 


57832 
41493 
15847 
12820 
74016 


51209 
63399 
92664 
04582 
09606 


27026 
73009 
72880 
26865 
03425 


30170 
45485 
04653 
35989 
22352 


49801 
93567 
31568 
72892 
39876 


71098 
23288 
34581 
13700 
65527 


56704 
75897 
30747 
89590 
62663 


93006 
28824 
81766 
67303 
47185 


06962 
52224 
02695 
82359 
23087 


98960 
18795 
15805 
11293 
85666 


19864 
00810 
81812 
75666 
06527 


61781 
98036 
90519 
16897 
33152 


13873 
52091 
60952 
42085 
39800 


47951 
23758 
03029 
92453 
93014 


05829 
75074 
93165 
04084 
87841 


63699 
39594 
79674 
89012 
82167 


59752 
62126 
81231 
15504 
68216 


52025 
09677 
54408 
55861 
72830 


75470 
17750 
06558 
62162 
69066 


40993 
78102 
99528 
09048 
03020 


09655 
40746 
72644 
70382 
80281 


22706 
17711 
13201 
05507 
81105 


35350 
86401 
61152 
97353 
53545 


28314 
88922 
95903 
68124 
94452 


59762 
32414 
63832 
59770 
51595 


63075 
66119 
85790 
70678 
15386 


67516 
14258 
26399 
16287 
39411 


19420 
57532 
03839 
78772 
71950 


23908 
48138 
13437 
72337 
99985 


84206 
30630 
34002 
55210 
27931 


46091 
22328 
85935 
86600 
53788 


94834 
15957 
54268 
00115 
79698 


94513 
64081 
02839 
43889 
16848 


84072 
76378 
71550 
98263 
18065 


74343 
81044 
06743 
82641 
96364 


68394 
81274 
22454 
92686 
33527 


93433 
15839 
40379 
02513 
28219 


77286 
14898 
04008 
46641 
59564 


18351 
41620 
91279 
98921 
01735 


76311 
16175 
69239 
71579 
01660 


39898 
98099 
82053 
21025 
70695 


24773 
26772 
92950 
95513 
22892 


38339 
10484 
30079 
41507 
26169 


86847 
76329 
87652 
75567 
42134 


51864 
77751 
65506 
32061 
60506 


32666 
99982 
87109 
89488 
12041 


82926 
82679 
16182 
03315 
98681 


92896 
43899 
69508 
20170 
97746 


08554 
90644 
30904 
74546 
20241 


91982 
77998 
77643 
95191 
44950 


* Reprinted by permission from RAND Corporation, A Million Random Digits, The Free Press 
(1955), p. 61. 


306 


Appendix B 


NUMERICAL 
ANSWERS 


307 


308 m APPENDIX B 


2.2.2. Cell entries are: 


39: 27 49/9 -14.518.. (787 31 
29 21 
5 1 
Si 27 
2.2.5. a. 37, 8; b. 32 liked, 5 did not, 8 had no opinion; c. 86.596. 
2.2.7. Cell entries are: LOIS - 129.725 
8 1 0710 279 
10 Т ОО 11 
28: 16 1560 545 
2.2.8. Cell entries аге: 3 4 9.0) 416 
O "6d. 10:93; 59017 
a 5 1240. 110 
1 0 То 12 
8 13. 21, 3 45 


a. 10; b. 3, 6.67%; c. 82.22%; d. 64.44%. 


нат ит Ри A ПНЕ А —— ——-—— 


2.31 Opinion on legalizing marijuana 
Political c 
party Strongly Mildly No Mildly Strongly 
preference disagree disagree opinion agree agree Totals 
Democrat 19 3 2 26 10 60 
Republican 17 7 3 22 1 50 
Other party 1 1 1 3 0 6 
No preference 12 3 2 27 20 64 
Totals 49 14 8 78 31 180 


2.3.5. c. (a) 67, (b) 66, (c) 10-19. 

3.2.1. a. 423.6; b. 423; c. 426; d. 424. 

3.2.2. а. теап = 11. 

3.2.3. a. 36.20; b. 36.3; с. there is no single mode. 


3.2.4. a. mean = 102.9, median = 104, midrange = 104. 
b. 6096. c. 1596. d. 102,900; 98,760. 


3.2.5. a. mean = median = midrange = 1.2 ounces. 


3.2.6. a. 50.2. b. there is no mode; median — 45. 


3:2. T: Volume Last price Net change 
Mean 720,180 31.367 1,138 


Median 718,500 281 +14 


APPENDIX B m 309 


3.2.8. i. 4395; d $are ну dii. 16:5; iv. 47.4. 


3.2.9. a. line Z1: 1.213, 1.22, 1.22; line #2: 1.244, 1.22, 1.22. 
6; 1:229: 


3.2.10. a. 12,518.2; 10,700; 10,500; 13,750. 
3.2.11. a. 14,443; b. mean = 1444.3, median = 1714.5. 
3.2.13. 18:93025; 
3.2.15. a. mean = 201 (201.2). 
3.2.16. c. 183.4; d. 183. 
3.2.17. a. 62,550; b. 54,000; e. roughly 45,000. 
3.2.20. (a) 88.4; (b) 88; (c) 88. 
3.2.21. (a) 67.02; (b) 67; (c) 67. 
3.2.22. а; 70:75 71,770: 
3.2.23. b. 20, 26.25, 20.75, 20.5, 19.5, 18.75. 
3.4.1. 12, 11.94, 3.45, 0.81%. 
3.4/2. 8, 5164, 2:32,:21:5 90: 
3.4.3. 2.5, 0.375, 0.612, 1.69%: 
3.4.4. 0.8, 0.0667, 0.258, 21.5%. 


3.4.5. Line #1: 0.16, 0.00283, 0.0532, 4.39%. 
Line #2: 0.14, 0.00223, 0.0472, 3.79%. 


3.4.6. 58, 316.37, 17.79, 28.44%. 

3.4.7. 26, 63.41, 7.96, 9.00%. 

3.4.8. 20, 6.62, 2.57, 3.83%. 

3.4.9. 37, 140.9, 11.87, 16.79%. 

3.5.1. b. $5,000; $2,108.21; $2,058.51; $854.80; 40.55%. 


3:5:2. 
Sample #1 Sample #2 Sample #3 Sample #4 
oie и ата ee ee ЛЕ == 


a. (i) male 71.11% 51.11% 62.22% 46.67% 
(ii) female 28.89% 48.89% 37.78% 53.33% 

b. mean 15.60 11.48 12:52 10.49 
median 10.00 11.00 8.00 7.50 
s.d. 12.48 10.72 10.95 8.79 


см. 80.00% 93.38% 87.46% 83.79% 


310 m APPENDIX B 


d. Percentage agreeing to legalization: 


Sample Z1 Sample Z2 Sample Z3 Sample #4 


Overall SITA 62.22 71:11 55.56 
Male 43.75 69.57 85.71 57.14 
Female 69.23 54.55 47.06 54.17 
Democrat 58.82 71.43 53.33 50.00 
Republican 33.33 18.18 83.33 53.85 
Other party 0.00 == 50.00 100.00 
No party 72:13 80.00 81.25 56.25 


e. Grade point average (G.P.A.): 
Sample #1 Sample #2 Sample Z3 Sample #4 


Mean 2.39 2.48 2.43 2:31 
Standard deviation 0.65 0.48 0.62 0.51 
Mean: male 2.35 2.46 2.42 2.24 
S.d.: male 0.74 0.56 0.63 0.50 
Mean: female 2.50 2.50 2.43 2.37 
S.d.: female 0.36 0.39 0.61 0.51 
Mean: 0-9.5 mi. 2.07 2.35 2.55 2.29 
10–19.5 ті. 2.68 2.64 2.28 2.49 
20-29.5 ті. 2.48 2.40 2.38 2.16 
30–39.5 ті. 2.93 — 2.14 2.37 
40-49.5 ті. — — 2.08 — 
50-59.5 mi. 3.34 3.37 — — 
4.3.1. 1/8, 3/8, 1/8. 
4.3.2. P(0 head) = 1/16 P(3 heads) = 1/4, 
P(1 head) = 1/4, P(4 heads) = 1/16. 


P(2 heads) = 3/8, 
4.3.3. 1/36, 1/18, 1/12, 1/9, 5/36, 1/6, 5/36, 1/9, 1/12, 1/18, 1/36. 


4.3.4. (a) 1/4, (b) 1/2, (c) 1/13, (d) 3/13, (e) 8/13 [9/13 if ace is counted as 
“опе”, (f) 1/26. 


4.3.5. (а) 19/45, (b) 5/18, (с) 2/45, (d) 7/30. 
4.3.6. 21/76, 1/38, 15/76. 
4.3.7. 31/104, 8/19. 
4.3.8. 12/53, 1/2. 
4.3.9. 1/3, 7/12, 277; 
4.3.10. 2/3, 2/5. 


APPENDIX B m 311 


4.3.11. 1/3, 10/27, 8/27. 
4.5.1. (a) 1/1024, (b) 33/66640. 
4.5.2. (a) 1/256, (b) 33/16660. 


4.5.3. No pair shows independence: 29/180 vs. 13/81, 1/36 vs. 13/675, 
37/180 vs. 416/2025, 3/20 vs. 19/135, 7/60 vs. 19/162, 1/180 vs. 
19/1350, 3/20 vs. 304/2025. 


4.5.4. No: 1/90 vs. 32/2025. 


45:5. 2277: 
4.5.6. a. 0.4096, b. 0.6723, c. 0.5886. 
4.5.7. .0905. 
4.5.8. (a) .0139, (b) .0000 (less than .00005). 
4.5.9. .6778. 
4.5.10. 10. 
4.7.1. (a) .2119 (e) .8849 
(b) .9773 (f) .1151 
(c) .1587 (g) .1151 
(d) .5793 (h) .8849 
4.7.2. (a) .6710 (d) .8664 
(b) .0501 (e) .1336 
(c) .1554 (f) .0895 
4.7.3. (a) —1.282 (d) 1.282 (g) —0.674 
(b) 0.524 (e) 0 (h) –0.842 
(с) –0.524 (f) 0 (i) 1.645 
4.7.4. (a) —1.036 (d) —1.405 
(b) 0.674 (e) 2.326 
(c) 0.253 (f) —1.645 
4.7.5. (a) —1.282 <z < 1.282, (b) –2.576 < z «2.576. 
4.7.6. (a) —0.674, 0, +0.674; (b) —0.842, —0.253, +0.253, +0.842. 
4.8.1. р=4.00, o —3.32. 
4.82. (а) p=4 а — 5.25; (b) ш= 1.00, o? — 8.70. 


4.8.3. ш=2, а?=1. 
4.8.4. р = 4.0000, о? = 2.4002 (4.0, 2.4). 
4.8.5. (а) .6915, (b) .2420, (с) .0548, (d) .7881. 


312 m APPENDIX B 


4.8.6. 
4.8.7. 
4.8.8. 
4.8.9. 


4.8.10. 
4.9.1. 
4.9.2. 
4.9.3. 
4.9.4. 
4.9.5. 
4.9.6. 
4.9.7. 
5.3.2; 


5.3.4. 
5.6.1. 


5:62. 
546.3. 
5.6.4. 
5.6.5. 
5.6.6; 
5.6.7. 
5.6.8. 
5:74. 
572. 
Sud. 
5.7.4. 


47, 6. 
a. 0.0062; b. (a) 1151, (b) 1587. 
(a) 1.79%, (b) 84.64%, (c) 127.3 psi, (d) 123.4, 136.6 psi. 


a. 82.096 с. 1.4%, 0.3% 
b. 68.396, 95.5%, 99.7% d. 140 or above 


10 to 18 days, 7 to 21 days. 

(a) .1587, (b) .0179. 

.7119; P(number of deaths = 420|п = 2000, p = .23) = .0179. 
.3805. 

.0359. 

.2742. 

:2119. 

Р(846 = Y < 954| У binomial, n = 5400, p = 1/6) = .9546. 


(а) 343, 294, 418, 446, 471, 691. 

(b) 3275, 0188, 4118, 2591, 2889. 

(с) 044, 099, 053, 025, 513, 339, 484. 

(d) 5298, 3275, 4574, 2297, 1953, 6439, 2214, 2064. 


b. median = 9600; c. mean = 10,220. 


a. (i) 14.76 € p « 20.24; (ii) 13.89 < u 21.11; (iii) 5.48, 7.22. 
b. 2.46, 3.24; c. 0.78, 1.02; d. 14. 


а. (i) 220.2< 4239.8; b. 68. 
38.02 < u < 39.98. 

а. 2.912: <5.09: b. 173. 
151. 

666. 

78. 

Yes: 98.32 < џ < 129.68. 
68.53< u «81.31. 
43.19 < u < 55.81. 

a. 109.26 < u < 138.74. 
$52.85 < р < $67.15. 


5273s 
5.7.6. 
2:91: 
3:9.2. 


5.9.3. 
5.9.4. 


5.9.5. 


5.9.6. 


5.9.7. 
5012.1. 
5.122. 
5.12.3. 
5.12.4. 


5.12.5, 


5.12.6. 
5,127. 


5.12.8. 
5.12.9; 


5.12.10, 


5:12:11. 


6.6.1. 


6.6.2. 


6.6.3. 


6.6.4. 
6.6.5. 


APPENDIX В m 313 


7,08 < и «132. 
62.78 < u <65.22. 
a. —L13 ja 25229. 


а. Coded: 0.17 € ua – Ha <5.41; in original units: .04017 € 
ив — Ba € 04541. 


—6.83 < pa — шв < 2.83. 
a. —1.24< pa — ue € 0.52. 


a. 576 и <14.68; b. 5.15< и <12.19; c. 12.97; 
d. —3.42<pir- иг <6.52. 


a. A 95% confidence interval for difference in mean percents of 
error is 0.26 € ив — Ha < 1.48. 


—0.29< pu- er «1 1.69 (U = untreated, T = treated). 

a. 10%; b. 2.4% « p € 17.696. 

49 < number of germinating seeds < 111 (12.2% <p < 27.8%). 
0.392 <р <0.888. 

41% « p « 15.996. 

31.1% Ср <48.9%. 

0.00040 < pa — рв < 0.05210. 

0.075 < pw — рм <0.249. 


a. 39024 by 0.6% <p< 5.4%. 


2655; 956. 
b. 84; c. 2.0% <p< 12.0%; d. 24< number defective < 144. 
a. 2%; b. 0.9% <p <3.1%; c. 160+88. 


a. (7–50)/2 = Z; reject Ho if z = —1.960 or if z = +1.960. 
b. (Y-75)1.131 = 2; reject Но if 222.326. 
с. (У – 300)/5 = Z; reject Ho if zz-1.645. 


a. (Y —50)/2 = Z; reject Ho if z = 1.645. 
b. Reject Ho if ӯ =53.290. P(accept Ho | ш= 55) = 0.1841. 


(У —90)/2.667 = Z; reject Ho if 2 =-2.326. 
P (deciding for special study | u = 80) = 0.9192. 


Evidence not sufficient: z — —1.5, P — 0.0668. 
Agree with complaint: z — 7.00, P «0.00001. 


314 m APPENDIX B 


6.8.1. 
6.8.2. 


6.8.3. 


6.8.4. 


6.8.5. 


6.9.1. 
6.9.2. 


6:9:3; 
6.9.4. 
6.9.5. 


6.9.6. 


6.9.7. 
6.9.8. 


6.9.9: 


6.11.1: 


6112. 


6.11.3. 
6.11.4. 


Accept Ho: 1= 1.20 (as against critical value 2.539), .10<P<.15. 


(a) =.702 or =.898, (b) =.693 or =.907, 

(c) =9.67 or 210.33, (d) =9.67 or =10.33, 

(e) «33.56 or 246.44, (f) =25.40 or 2 54.60. 

a. Evidence not sufficient: t= 1.12 (as against critical value 2.093), 
.20<Р:<.30: 

b. Decide against change: t = 1.12 (as against critical value — 1.729), 
85 « P <.90. 


a. Disagree: t = —0.534 (against critical value —2.365), .50 < P < .70. 
b. Disagree: 1— 0.814, .30 € P —.50. 

с. .024664 < u < .025212, .024857 < u < .025293. 

d. –.000180 < up — Ha < +.000454 (95 percent confidence). 

a. Not significantly less: t — —0.586 (against critical value —1.383), 


укр == 35, 
No: t= 1.33 (against critical value 2.101), P = .20). 


a. Yes: t — 2.39 (against critical value 1.711), .01 < P < .025. 
b. 14.92. 4.24< Ba – ро € 25.60. 


Yes: г= —3.02; .001 < Р<.01. 
b. No: г= —0.870 (against critical value —1.734), .30 < P —.50. 


Brands as observed differ significantly: t = 2.45 (against critical value 
2.048), Р=.02; 95 confidence interval is 0.41 < LA — ив < 4.59. 


a. Sample means do not differ significantly: t = —1.90 (against criti- 
cal value —2.179), .05<P<.10. 
b. —0.41 < us — uA < 5.99. 


Agree: t— 3.92 (against critical value 2.43), P<.0005. 


a. 4.96< u <8.38. b. Reject Ho: t=3.41 (against critical value 
3.055), .001<P<.01. c. 0.27 « и, — њ<5.07. 


a. Observed increase in means is not significant: t= 0.438 (against 
critical value 1.697), .25 < P < .35. 


Observed rate is significantly lower: 2 = —2.50 (against critical value 
—2.326), P = .0062. 


At 1% level, no; at 5% level, yes: z —2.10 (against respective 
critical values 2.576, 1.960), P = .0358. 


No: 2 — 1.22 (against critical value 1.960), P = .23. 


Observed proportion defective not significantly greater than stan- 
dard: z = 1.53 (against critical value 1.645), P = .06. 


6.11.5. 


6.11.6. 


6.11.7. 


6.11.8. 


6.119. 


ВА 
7.8.2. 


1.8.3. 
7.8.4. 


1:83. 
7.8.6. 
T:8.2. 


7.8.8. 
7.8.9. 
7.8.10. 


1,811, 


8.3.2. 
5:33. 
8.3.4. 
8:35, 
8.3.6. 
8.3.7. 


APPENDIX В m 315 


No: z = —2.17 (against critical value —1.960), P —.03. 


Observed death rate not significantly less than standard: z = –2.26 
(against critical value —2.326), P ~ .0107. [But at any level 7.0107, 
conclusion would be the opposite.] 


Yes: z —1.73 (against critical value 1.645), P—.04. [But 95% 
confidence interval is 49.4% < p < 60.6%]. 


No: 90% confidence interval for p is 45.8% <p<62.2%, 50% 
confidence interval is 50.6% <p<57.4%. 


а. Мо: 2 = 0.601 (against critical value 1.282), P=.27. 

b. 214, 53.5%. 

с. Observed preference not significantly different from 50%: z= 
1.40 (against critical value 1.960), P=.16. 


Yes: дї = 6.628 versus xj+=3.841 (.01<P<.025). 


Observed difference not significant: хї = 0.172 versus xix = 3.841 
(60 < P « 10). 


b. Accept Ho. х1 = 1.333 versus х1+ = 3.841 (.20<Р < .30). 


Evidence not sufficient to reject the hypothesis: (a) z=—1.130 
versus z= — 1.960; (b) х1 = 1.280 versus xi: = 3.841. 


Yes: x1 = 109.74 versus xi» = 3.841 (Р < .0005). 
Мо: ха = 1.905 versus xi» = 3.841. 

No: х= 9.705 versus ха“ = 11.070. 

Reject Ho. ха = 9.60 versus у] = 3.841. 
Observed association is significant: x4 = 11.663 versus xa = 9.488. 


@ 
Observed association is significant: уз= 267.81 versus X3«7 7.815. 
In reduced 2 x 2 table, observed association is significant: X1 = 159.67 
versus yi» = 3.841. 


a. Significantly different: х5 = 58.439 versus 43991. 
b. Significantly different: ха = 40.646 versus x3+= 5.991. 


b. ¥=9.0, y =4.0. 
b. ¥=9.0, У =3.0. 
а. 12; b. 9=44+1.2G—9), 658 


0.60, –2.4. 
a. 1.6, 2.8, 4.0, 5.2, 64; b. —0.6, 0.2, 1.0, -0.2, -04. 


a. 1.8, 24, 3.0, 3.6, 4.2; b. 1.2, -0.4, –2.0, 0.4, 0.8. 


316 m APPENDIX B 


8.3.8. 


8.541. 


8:5:2. 
8.8.1. 
8.8.2. 


8.8.3. 
8.8.4. 
8.8.5. 
8.8.6. 
89.1: 


8.92. 
8.9.3. 


8.9.4. 


8.9.5. 


8.9.6. 
58:917. 


a. 


b. 


a. 
d. 


d.f. S.S. M.S. 


Total (crude) T 784 
Sample mean 1 700 
Total (corrected for mean) 6 84 
Slope [Regression] 1 75.5712 75.5712 
Residual 5 8.4288 1.6858 


1.6858; c. 1.30; d. 89.97%. 


2.13: b. 3: c. 0.462; 
accept Ho (t = 1.299 versus ty = 3.182). 


—ld4« «50. атбагар 2971: но 112. 


а. 


а. 
C. 


0.491; b. 0.886. 


0.886; b. 12.64 and 17.20, 8.74 and 11.26, 2.80 and 7.36. 
2.5 71. 


10.88, 18.96; 6.43, 13.57; 1.04, 9.12. 


0, 


4.5; 0,2, 5:45 1,5, 6:520:07 7:82 3:5 9,30 


—0.82, 0:27; 1.37; —0/27, 20:55. 
0,82, =0:27. —1.37, 0:27.10:55. 


a. 
b. 
d. 


у =7.60+0.0629х. 

Reject Ho (t=3.51 versus |; = 2.160) 

—16,. —.32, —:29,'— 452 109 1339346, -30:- 24; .37;..11, —25, 
—.41, —.28, —.04. 


Agree: hypothesis test rejects В —0 (t — 2.609 versus t4 —2.447). 
b. у = 592.56—6.73х. 


d. 
е. 


а. 
b. 


yes: t — —8.00 versus t4 = —2.228. 
137.65, 149.01. 


ý = 106.07 —0.0695x. 
=0:0695: ке: 03651; "а: accept Но (t=—0.190 versus t= 
= ТЫ) 


. ў = 0.0334+0.7661х. 
. 0.2147; с. 0.3086< 8 < 1.2236; d. 2.9080. 


» 1.267 b:2.06. 
· у =50.04+0.927х; b. 927 cases, not significant (t= 1.38 versus 


ty = 1.734). 


Index 


Acceptance of a hypothesis, 212 
Acceptance region, 219 
Analysis of variance, 271 
Arithmetic mean, 60 
Association, 243, 246 


Bar chart, 38 
segmented, 42 
Bernoulli trials, 117 
Best fit, 267 
Bimodal distribution, 64 
Binomial distribution, 120 
normal approximation, 149 


Central limit theorem, 165 
Centrality, measures of, 59 


Chi-square test of association, 243, 247 


binomial proportion, 236 
с proportions, 239 
homogeneity, 250, 252 
independence, 243, 247 


shift in binomial proportion, 255 


Coding дата, 4, 8, 34—35 
Coefficient of variation, 85 
Confidence, 167 


Confidence interval, binomial proportion 


p, 198 


difference between two proportions, 


Pi-P2, 202 


difference ш -u2 when 01, о» are 


known, 184 


difference ш "Иг when 01, 02 аге 
unknown, 189 
mean и when о is known, 169 
mean u when 0 is unknown, 177 
mean response in regression, 279 
next observation in regression, 283 
slope of regression line, 276 
Contingency table, 242, 247 
Continuous, 11 
correction for continuity, 150 
random variable, 126 
Correction for continuity, 150 
Countably infinite, 11 
Critical region, 210 
Cumulative frequency polygon, 47 
Cumulative percentage polygon, 47 


Data, 8 

categorical, 8 
with ranking, 8 

continuous, 11 
counting, 10 
discrete, 9 
interval-scale, 12 
measurement, 8 
nominal, 9 
ordinal, 10 
qualitative, 234 
ratio-scale, 13 
summary of kinds, 14 

Decile, 144 


317 


318 8 INDEX 


Decision rule, 211 
Degrees of freedom, 83, 175 
in chi-square tests, 238 
Descriptive level of significance, 213 
Deviation, standard, see Standard 
deviation 
Discrete, 9 
random variable, 133 
Dispersion, 81 
Distribution, binomial, 120 
chi-square, 236 
Gaussian, 127 
general normal, 139 
normal, 139 
standard normal, 127 
Student's t, 175 
x’, 236 


Estimate of binomial proportion p, 195 
binomial proportion p (pooled), 230 
intercept of regression line, 268 
population mean и, 164 
population standard deviation о, 174 
slope of regression line, 268 
variance (pooled), 188 

Estimation, 155 
and testing compared, 218 

Events, 104 
compound, 113 
conditional, 113 
independent, 113, 115 
intersection, 113 
mutually exclusive, 104 

Expectation, 133 

Expected value, 133 


Factor, 182 
Frequency polygon, 46 
Frequency table, 29, 90 


Gambling odds, 111 
Gaussian distribution, 127 
Geometric mean, 65 


Harmonic mean, 65 
Histogram, 43 
Hypothesis, test of, 212 


Independence, 113, 115 

test of, 243, 247 
Independent events, 113 
Independent repeated trials, 116 
Inequalities, rules on, 141 
Inference, 101 
Intercept, 265 

estimate of, 268 
Interval scale, 12 
Intervals, choice of, 28 


Least squares, 267 

Level of a factor, 182 

Level of significance, 209 
descriptive, 213 


Mean, 60, 133 
arithmetic, 60 
geometric, 65 
harmonic, 65 
of binomial distribution, 136 
of probability distribution, 133 
of random variable, 133 
of sample, 163 
of sample mean, 164 
of sample proportion, 197 
Median, 62 
of random variable, 143 
Midrange, 64 
Mode, 64 


Nominal data, 9 

Normal approximation to binomial, 149 
Normal distribution, 127, 139 

Null hypothesis, 209 


Observations, 8, 138 
Odds, 110 
One-tailed test, 214 


Ordinal дата, 10 


Parameter, 121, 132, 238 

Pattern of chance, 124 

Percentile, 144 

Pie chart, 40 

Polygon, frequency, 46 
cumulative frequency, 47 
cumulative percentage, 47 


Pooled estimate of population propor- 


tion, 230 
of variance, 188 
Population, 100, 138 
Prediction of mean response in regres- 
sion, 277 
next observation in regression, 282 
Probability, 103 
conditional, 113 
density, 126 
function, 133 
Probability density function, 126 


Quartile, 144 
Quintile, 144 


Random, 103 

Random digits, 158 

Random numbers, 159 

Random sample, 156 
method for drawing, 158 

Random variable, 117 
binomial, 120 
continuous, 126 
discrete, 133 

Range, 81 

Ratio scale, 13 

Regression, 271 

Rejection, nonrejection, 212 

Residuals, analysis of, 285 
sum of squares for, 271 

Response, 182 


Sample, 8, 100, 138 


INDEX ® 319 


Sample mean, 60, 163 
Sample proportion, 195 
Sample size, 8 
required, 170, 199 
Sample standard deviation, 85, 177 
Sample variance, 83 
Segmented bar chart, 42 
Significance, descriptive level of, 213 
level of, 209 
test of, 212 
Skewed distribution, 63 
Slope, 265 
confidence interval for, 276 
estimate of, 268 
test of hypothesis concerning, 275 
Squares, sum of, see Sum of squares 
Standard deviation, 85, 135 
of binomial distribution, 136 
of estimated slope in regression, 275 
of next observation in regression, 283 
of predicted mean in regression, 278 
of probability distribution, 135 
of random variable, 135 
of sample mean, 164 
of sample proportion, 197 
Standard error, difference of two sample 
means, 183, 189 
estimated slope in regression, 275 
next observation in regression, 283 
predicted mean in regression, 278 
sample mean, 170, 178 
sample proportion, 198 
difference of two sample proportions, 
201 
Standard normal distribution, 127 
Statistic, 58 
Statistical inference, 101 
Statistics, 2 
descriptive, 101 
Student's t distribution, 175 
Sum of squares, 88 
due to the mean, 263 
for regression, 271 


320 8 INDEX 


for residual, 271 
for slope, 271 


t distribution, 175 
Table composition, 26 
Table of frequency, 29, 90 
Test, binomial proportion p, 228 
difference between means, Hı -M2, 222 
difference between two proportions, 
pi-p2, 230 
of hypothesis, 212 
mean и when о is known, 209 
mean и when о is unknown, 216 
of significance, 212 
Testing and estimation compared, 218 
Two-tailed test, 214 


Universe, 100 


Variability, measures of, 81 
in a population, 102 

Variance, 82, 135 
of binomial distribution, 136 
of probability distribution, 135 
of random variable, 135 
residual, 271 

Variation, 81 
coefficient of, 85 


x test of association, 243, 247 
binomial proportion, 236 
c proportions, 239 
homogeneity, 250, 252 
independence, 243, 247 
shift in binomial proportion, 255 


"ЖЫ = j- 


p—-————————————————-———RH n << === AN A 
UM Pines А * Џ у E | "m у У ES | 


ISBN 0-471-50928-0 


